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Preface 



Galois connections provide the order- or structure-preserving passage 
between two worlds of our imagination - and thus are inherent in hu- 
man thinking wherever logical or mathematical reasoning about cer- 
tain hierarchical structures is involved. Order-theoretically, a Galois 
connection is given simply by two opposite order-inverting (or order- 
preserving) maps whose composition yields two closure operations (or 
one closure and one kernel operation in the order-preserving case). 
Thus, the “hierarchies” in the two opposite worlds are reversed or 
transported when passing to the other world, and going forth and back 
becomes a stationary process when iterated. The advantage of such an 
“adjoint situation” is that information about objects and relationships 
in one of the two worlds may be used to gain new information about 
the other world, and vice versa. In classical Galois theory, for instance, 
properties of permutation groups are used to study field extensions. 
Or, in algebraic geometry, a good knowledge of polynomial rings gives 
insight into the structure of curves, surfaces and other algebraic vari- 
eties, and conversely. Moreover, restriction to the “Galois-closed” or 
“Galois-open” objects (the fixed points of the composite maps) leads 
to a precise “duality between two maximal subworlds” . 

Though the name “Galois connections” refers to the classical example 
of Galois theory, where subfields of a field and subgroups of the au- 
tomorphism group are related by a dual lattice isomorphism, a richer 
supply of examples comes from certain relations of annihilation or 
orthogonality, well-known from various branches of algebra, geome- 
try and topology - and in fact, this was the starting point for Garrett 
Birkhoff when he invented in the thirties of the past century his general 
“polarities” as Galois connections between power sets, arising from re- 
lations between the ground sets. The more general order-theoretical 
type of Galois connections ( alias Galois connexions or Galois corre- 
spondences) as defined above is due to Oystein Ore (about 1940). 
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One or two decades later, the even more general notion of adjoint 
functors was discovered by workers in category theory, and various 
very useful types of categorical Galois correspondences were subse- 
quently invented. 

Concerning the position of Galois connections within mathematics, it 
has to be admitted that deep theorems rarely are immediate conse- 
quences of the general theory of Galois connections, but usually require 
some extra tools and ideas stemming from the specific theory under 
consideration. But the general framework, supporting the intuition 
and suggesting the appropriate concepts, is established by the discov- 
ery of the “right” underlying Galois connection, followed by a good 
characterization of the “Galois-closed” (or “Galois-open” ) elements or 
sets. And there is no doubt that many proofs become shorter, more 
elegant and more transparent in the language of Galois connections. 

The aim of this book is to present not only the main ideas of gen- 
eral Galois theory, of concepts related to Galois connections, and of 
their appearance in various classical and modern areas of mathemat- 
ics, but also to show how they can be applied in other disciplines. The 
book consist of 15 contributions, written by authors who are special- 
ists in diverse fields of mathematics and who use Galois connections 
in their work. Although the majority of the contributions are dealing 
with themes of classical and universal algebra, the reader will also find 
many logical, order-theoretical, topological and categorical aspects - 
and, last but not least, several practical applications, sometimes even 
in extra-mathematical fields like linguistics, digital representation of 
graphics, or data analysis. In fact, the methods of modern formal 
concept analysis, based on polarities arising from certain relations be- 
tween objects and attributes, have applications in quite diverse non- 
mathematical areas of sciences and arts. Readers interested in that 
theory, its mathematical background and its applications, are referred 
to the monograph Formal Concept Analysis - Mathematical Founda- 
tions by B. Ganter and R. Wille (Springer- Verlag, 1999). 

In the first contribution, which is also aimed at “novices” in the the- 
ory of Galois connections, M. Erne presents the basic ideas and con- 
cepts: order-theoretical Galois connections in their original contravari- 
ant form with antitone (order-inverting) maps and in their covariant 
form with isotone (order-preserving) maps, also referred to as adjunc- 
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tions or “mixed” Galois connections. The classical examples come 
from polarities and axialities, i.e. Galois connections and adjunctions, 
respectively, between power sets. A journey through three centuries 
of mathematics illustrates the historical development of the main con- 
cepts in that area. After a discussion of diverse examples in classical 
Galois theory and other parts of algebra, the pioneer achievements 
of the twentieth century are sketched, in particular, Birkhoff’s idea 
of polarities and Ore’s Galois connexions. The journey ends with a 
survey of residuation theory, where binary operations having adjoint 
translations are the central tool. More recent developments, in par- 
ticular, the categorical types of Galois correspondences and the even 
more general adjoint functors had to be omitted in this historical in- 
troduction, due to space limitations. Readers interested in these topics 
but not familiar with categorical thinking may consult, for example, 
the book by Adamek, Herrlich and Strecker: Abstract and Concrete 
Categories (J. Wiley & Sons, 1990). 

In Dyadic Mathematics - Abstractions from Logical Thought, R. Wille 
sets out the philosophical, logical and mathematical ideas furnishing 
the foundations of formal concept analysis, and hence of concrete ap- 
plications of Galois connections. He demonstrates how human logical 
reasoning is based on concepts and how these may be formalized math- 
ematically by forming contexts and their concept lattices. As the title 
indicates, the emphasis is on dyadic conceptions and results in order 
and lattice theory, contextual logic, algebra, and geometry. This sur- 
vey article is of interest not only for people looking for applications 
of Galois connections, but also for those who want to understand the 
logical and philosophical background. 

Three other contributions are devoted to some order- and lattice- 
theoretical aspects of Galois connections and their applications in 
other fields. 

In “Galois Connections and Complete Sublattices” by K. Denecke and 
S. L. Wismath, three different methods are described to characterize 
complete sublattices of a complete lattice, by using a conjugate pair 
of additive closure operators, by special closure or kernel operators 
defined on the given complete lattice, and by so-called Galois-closed 
subrelations. Basic tools here are contexts and concept lattices, known 
from formal concept analysis. 
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The frequently observed phenomenon that approximation properties 
of certain relations on ordered sets are closely related or even equiva- 
lent to certain (infinite) distributive laws for the given ordered struc- 
tures or some of their completions, is studied in “The Polarity between 
Approximation and Distribution ” by M. Erne. It turns out that the 
right framework for those investigations is a certain polarity coming 
from an elementary relation defined in terms of a fixed closure opera- 
tion on a frame ( alias locale, a pointfree abstraction of topology). Via 
that polarity, the Galois-closed approximating subsets are in one-to- 
one correspondence with the sublocales containing all closed elements, 
and the Galois-closed strongly approximating subsets correspond to 
the completely distributive sublocales. These results encompass many 
specific representation theorems in order theory and related fields such 
as domain theory and pointfree topology. 

Adjoint pairs of maps on the integers or natural numbers are studied 
by J. Lambek in “Iterated Galois Connections in Arithmetic and Lin- 
guistics”. He points out that such adjoint pairs simply correspond to 
infinite subsets with an infinite complement, and that for maps that 
are unbounded on both sides, all iterated left and right adjoints ex- 
ist. In the second part, some categorical generalizations lead to the 
notion of pregroups, which have interesting applications in a modern 
treatment of linguistic problems. In particular, they provide a new 
algebraic approach to the structure of sentences in natural languages. 

Often it is necessary to define Galois connections for proper classes 
rather than sets. In order to circumvent set-theoretical difficulties in 
that extended setting, one usually assumes the existence of a “com- 
mon universe”. One of the most important Galois connections of the 
class-theoretical type comes from the relation of satisfaction between 
algebras and equations, or more generally, between partial algebras 
and formulas. (The dual relation is the relation of validity.) This ap- 
proach provides fundamental connections between algebraic structures 
and the equations (or formulas) they satisfy. The resulting Birkhoff- 
Tarski isomorphism theorems then establish dualities between varieties 
and equational theories, or between more general classes like quasivari- 
eties and implicational theories, etc. Four papers here deal with these 
and some other polarities arising in universal algebra. 




Preface 



xi 



The survey paper “Galois Connections for Partial Algebras” by P. 
Burmeister deals with such polarities in the rather general context of 
many-sorted partial algebras, which are of still increasing importance 
not only in universal algebra but also in computer science. Since in 
that general realm there are several kinds of equations, one also has 
various corresponding relations of validity. Another polarity discussed 
in that paper comes from the relation between maps and formulas 
of a certain signature, saying that the map reflects the formula (or 
equivalently, preserves the negated formula). Here, the resulting po- 
larity yields several interesting factorization systems for morphisms, 
known from category theory and universal algebra. In the last part of 
Burmeister’s paper, the power of attribute exploration via formal con- 
cept analysis is demonstrated for the discovery of interdependencies 
between properties of homomorphisms. 

A standard algebraic approach to fragments of implicational or intu- 
itionistic logic are Hilbert algebras and their deductive systems (con- 
taining the constant “true” and closed under modus ponens). In De- 
ductive Systems and Galois Connections , I. Chajda and R. Halas in- 
vestigate a general algebra A with a constant 0 and its so-called 6- 
deductive systems, where b is a binary term function of the algebra. 
These are closed under certain inference rules and form a closure sys- 
tem that is isomorphic, via a suitable axiality, to the system of all con- 
gruences 0 satisfying a0c iff b(a, c)0O and 6(c, o)0 0 (6-regular con- 
gruences for short). The basic tools for applications to congruence the- 
ory are so-called Fichtner terms of a variety, i.e. binary terms such that 
every congruence of any algebra in the variety is 6-regular (it suffices 
to ensure that property for the identity relations). The 6-deductive 
systems of any algebra in a variety having a Fichtner term turn out to 
be precisely the congruence kernels. In the second part, the polarity 
induced by the relation R = {(6, D) : D is a 6-deductive system of A} 
between binary terms and subsets of an algebra is investigated, and 
its Galois-closed sets are characterized for certain cases. 

In “ Complexity of Terms and the Galois Connection Id - Mod”, K. 
Denecke and S. L. Wismath study the influence of term complexity 
on the Galois connection between identities and algebras. Calling a 
variety Abnormal when both sides of any of its nontrivial equations 
have complexity at least k, they extend several central results on nor- 
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mal varieties to the k- th level and illustrate the limitations of such 
extensions. 

In the survey “Q-Independence and Weak Automorphisms as Galois 
Connections” , K. Glazek and S. Niwczyk present two further kinds of 
Galois connections appearing in universal algebra. These are related 
to general notions of independence and of weak automorphisms. In 
the first case, the basic relation for the required polarity is defined 
between subsets and partial maps of a fixed algebra: a subset X is 
related to a partial map p if the latter extends to a homomorphism 
on the subalgebra generated by X whenever X is the domain of p. In 
the second case, the basic relation is defined between permutations of 
a fixed set and clones over that set, and it holds iff the permutation 
preserves the clone (in a natural way). 

Another fundamental type of Galois connection for universal algebra 
is a central theme in several papers. This is the Galois connection 
(Pol, Inv) arising from the invariance relation “/ preserves q ” between 
operations of an algebra A and n-ary relations on A. In that case, one 
says that / is a polymorphism of g, or that g is invariant under /. 

In Galois Connections for Operations and Relations, R. Poschel gives 
a systematic overview of the manifold Galois connections obtained by 
restriction or extension of the connection (Pol, Inv ) to specified or 
generalized ranges, respectively. The main theorems yield intrinsic 
characterizations of the Galois-closed sets and the corresponding clo- 
sure operators on both sides. The general (not necessarily finite) case 
requires certain local closure operators providing interpolation, while 
on the relational side, one has to consider so-called relational clones, 
for example, various kinds of Krasner algebras or clones, corresponding 
(via Galois connection) to classes of endomorphisms and (weak) auto- 
morphisms (different from those considered by Glazek and Niwczyk). 
Moreover, the (Pol, Inv) approach provides nice solutions of simulta- 
neous concrete characterization problems for subalgebra lattices, con- 
gruence lattices and automorphism groups. On the other hand, there 
are useful extensions of (Pol, Inv) to partial functions, multifunctions 
and heterogeneous functions (cf. the article by P. Burmeister). 

In Galois Connections and Polynomial Completeness, K. Kaarli ex- 
plores different generalizations and variations of primality and polyno- 
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mial completeness (meaning that all functions, or at least all relation- 
preserving functions, are polynomials). His approach makes system- 
atic use of the Galois connection ( Pol,Inv ) and demonstrates how 
and why local interpolation plays such a crucial role in the infinite 
case. Thus, for example, a local term function is one that can be in- 
terpolated by a term function on every finite subset, and an algebra is 
called locally primal if every function (in several variables) is a local 
term function. Variations of that concept are (local) congruence pri- 
mality, (local) endo- or automorphism primality etc. (cf. the survey 
article by R. Poschel). Special attention is paid to varieties having a 
majority or near-unanimity term, to order affine complete lattices, to 
algebras with distributive lattice reduct, and to endoprimal algebras. 

A. Szendrei in her contribution A Survey of Clones Closed Under Con- 
jugation studies important complete sublattices of the lattice of all 
clones of operations on a finite set, using a Galois connection that can 
be derived from ( Pol,Inv ). 

The last three papers are devoted to themes where categorical methods 
help to generalize and to improve classical results of topology and 
Galois theory. 

W. Gahler shows in his survey Galois Connections in Category The- 
ory, Topology and Logic how partially ordered monads (a categorical 
notion very close to adjoint situations) provide the right framework 
for a very general development of abstract categorical topology and 
convergence theory. The basic idea here is to suitably generalize the 
“classical” filter monad, which has proved basic for many questions 
of convergence theory in the past. But, as recent developments have 
shown, there is also a large overlap of such topological theories with 
non-classical logics and computer sciences (see, for example, the book 
by St. Vickers: Topology via Logic (Cambridge Tracts in Th. Comp. 
Sc. 5, Cambridge University Press, 1989). In the second part of this 
paper, various types of fuzzy filters in locales and quantales are stud- 
ied more thoroughly, and their interdependencies are illustrated by 
several diagrams. 

J. Slapal’s note A Galois Correspondence for Digital Topology de- 
scribes an adjoint situation between the category of (generalized) clo- 
sure spaces in the sense of Cech (where only extensivity and preser- 
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vation of inclusion and of emptiness is required) and the category of 
relational systems of a given (possibly infinite) arity. This extends the 
well-known binary case, leading to a concrete functorial isomorphism 
between partially ordered sets and Alexandroff (discrete) spaces. How- 
ever, the general case is much more involved. For the objects of the 
resulting coreflective subcategory of closure spaces, connectedness co- 
incides with a certain kind of “discrete path-connectedness”. There- 
fore, the study of such objects is helpful in digital topology, where 
these notions of connectedness play a crucial role. 

In the final paper, G. Janelidze gives a survey of Recent Developments 
in Categorical Galois Theory. This rather advanced paper requires 
some familiarity with categorical methods and concepts. For read- 
ers who are not experts in that area, some preparatory literature is 
recommended, for example the book Galois Theories by F. Borceux 
and G. Janelidze (Cambridge Studies in Adv. Math. 72, Cambridge 
University Press, 2001), where the development from classical Galois 
theory to the modern categorical versions is described more compre- 
hensively. In the present article, one subsection deals with categori- 
cal versions of A. R. Magid’s Galois theory for rings, and another is 
concerned with central extensions of universal algebras. But, a bit 
surprisingly, the theory begins with some purely topological consider- 
ations on categorical characterizations of connectedness and compo- 
nents. A Galois structure is then defined as a triple consisting of an 
adjoint pair of functors and two subclasses of objects in the involved 
categories, called fibrations, having suitable closure properties with re- 
spect to certain categorical constructions. Many examples are given, 
but most of them also require some categorical background. The fun- 
damental theorem of Galois theory (in a categorical form) is based 
on Grothendieck’s formulation saying that the opposite category of 
subextensions of the field extension E/K is equivalent to the category 
of transitive Gal(E / K)- sets. The chosen approach confirms that cat- 
egorical language provides many improvements and simplifications of 
classical results: besides various related phenomena, it explains why 
the Galois theory of fields and the theory of covering spaces are just 
identical, if interpreted the right way. 

The plan for a book on Galois connections originated with the partici- 
pants of a conference on that subject held at the University of Potsdam 
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in 2001. The editors hope that this volume, though certainly not com- 
prehensive enough to cover all the important themes in this area, will 
help to make an easy but beautiful concept more popular. Thus, if 
the value of Galois connections as a unifying concept in quite diverse 
branches of mathematics, but also its usefulness as a practical tool 
both in mathematics and other areas, become more evident by the 
contributions in this book, our main intention will have been fulfilled. 

* * * 



Klaus Denecke Marcel Erne Shelly L Wismath 
Potsdam Hannover Lethbridge 

October 2003 
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Abstract 

Galois connections build the abstract framework not only for classical and 
modern Galois theory involving groups, fields and rings, but also for many 
other algebraic, topological, order-theoretical, categorical and logical theo- 
ries. 

We sketch the development of Galois connections, both in their covariant 
form (adjunctions) and in the contravariant form (polarities) through the 
last three centuries and illustrate their importance by many examples. The 
main steps in the development are: 

the theory of polynomial equations (Lagrange, Galois), 

the modern Galois theory (Dedekind, Artin), 

the origins of lattice theory (Dedekind, Schroder), 

the polarities and lattice-theoretical aspects (Birkhoff), 

the order-theoretical Galois connections (Ore), 

the logical calculus (Boole, Peirce, Schroder), 

and the residuation theory (Krull, Ward, Dilworth). 

Besides sporadic occurrences of adjunctions and Galois connections in im- 
portant mathematical theorems, we discuss diverse contributions to a sys- 
tematical theory of adjunction and residuation, and we touch on various 
applications to topology, logic, universal algebra and formal concept anal- 
ysis. 

Mathematics Subject Classification: 01-02 , 06A15 , 06D15 , 12F10. 

Key words: Adjoint map , Axiality, Closure operator, Complete lattice, Ga- 
lois connection, Polynomial equation , Field, Group, (Partially) Ordered set, 
Polarity. 



l 

K. Denecke etal. (eds.), Galois Connections and Applications, 1 - 138 . 
© 2004 Kluwer Academic Publishers. 




2 



M. Erne 



Contents 

1 The Idea of Adjunction and Galois Connections 

1.1 Adjunctions between Ordered Sets 

1.2 Closure and Kernel Operations 

1.3 G,alois Connections 

1.4 Polarities and Axialities 

1.5 Formal Concept Analysis 

2 The Roots of Galois Connections 

2.1 Prelude on Symmetric Functions 

2.2 Joseph-Louis Lagrange: Permutation Groups, Partially 
Symmetric Functions and Interpolation 

2.3 Evariste Galois’ Memoir on the Solvability of Equations 

2.4 Richard Dedekind’s Lectures on Algebra 

2.5 Dedekind’s Order- and Lattice-Theoretical Concepts 

2.6 Ernst Schroder’s Algorithms and Calculi 

2.7 David Hilbert’s Nullstellensatz 

3 Galois Connections in the Twentieth Century 

3.1 From Dedekind to Noether, Artin and van der Waerden: 
The Modern Galois Theory 

3.2 Garrett Birkhoff’s Polarities 

3.3 Oystein Ore’s Galois Connexions 

3.4 Adjunctions as Galois Connections of Mixed Type 

4 Residuation Theory 

4.1 From Boole to Schroder: The Logical Calculus 

4.2 Dedekind’s Invention of Lattice-Ordered Groups 

4.3 Wolfgang Krull’s Ideal Domains and Quantales 

4.4 Morgan Ward’s and R. P. Dilworth’s Residuated Lattices 

4.5 T. S. Blyth and M. F. Janowitz: Residuation Theory 

4.6 Heyting Algebras, Locales and Pointfree Topology 




Adjunctions and Galois Connections 



3 



1 The Idea of Adjunctions and Galois Connections 

Mais je n’ai pas le temps, et mes idees ne sont pas encore 
bien developpees sur ce terrain, qui est immense. 

Evariste Galois, in the letter to his 
friend Auguste Chevalier, written in 
1832, on the night before the duel 
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Galois connections are the main theme of this book. So, readers not 
familiar with that topic might wish to know: where do they come 
from, where are they used, and who were the prominent pioneers of the 
theory? I hope I will be able to answer at least in part these questions. 
Certainly it cannot be the intention of a historical introduction to 
cover all relevant instances of Galois connections, and one or the other 
expert will miss his own favorite theme. What I have tried to do is to 
collect together a series of situations where Galois connections enter 
(or may be introduced into) the picture of an interesting classical 
theory or problem. A comprehensive historical discussion of the roots 
of order and lattice theory can be found in the book by Mehrtens 
[92]. For historically oriented treatments of classical Galois theory, 
two comprehensive sources are Edwards [39] and Tignol [138]. 

In the 19th century, the concept of Galois theory was not yet born, 
though we shall have occasion to encounter it implicitly in the work of 
the prominent pioneers of Galois theory: Lagrange, Abel, Galois and 
Dedekind. 

In the 20th century, milestones on the way to the modern view of Ga- 
lois connections are Birkholf ’s “polarities” , Ore’s “Galois connexions” , 
and Schmidt’s “Galois correspondences of mixed type”, nowadays bet- 
ter known as “adjunctions” . Particular mention should be made of the 
residuation theory initiated by Krull, Dilworth and Ward, continued 
by Blyth and Janowitz, and leading to the modern theory of “quan- 
tales”. We also briefly touch on the practice-oriented interpretation 
of relations and polarities as “contexts” in the sense of formal concept 
analysis (see R. Wille’s contribution “Dyadic Mathematics - Abstrac- 
tions from Logical Thought” in this volume). However, reasons of 
limited space forbid us to have a closer look at the immense field of 
categorical Galois connections and functorial adjunctions. 

A number of examples from order theory, universal algebra and topol- 
ogy, showing how the tool of adjoint maps may facilitate or clarify con- 
siderably mathematical problems, will accompany our journey through 
the world of Galois connections. The examples chosen are sometimes 
rather elementary, but nevertheless some of them have caused break- 
throughs not only in the development of mathematical theories, but 
also in their applications. 
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1.1 Adjunctions Between Ordered Sets 

In their most general sense, adjunctions and/or Galois connections 
make it possible to relate two “worlds” of (more or less mathemati- 
cal) objects with each other in order to gain information about one 
world by passing to the other, perhaps better known world (cf. [15]). 
Slightly more precisely, the passage should be made so that certain 
order structures are preserved or inverted. In a nutshell, two typical 
“microcosmic worlds” could be the left and the right hand side of an 
equality or inequality. The usual process of simplifying or evaluating 
such an (in)equality is to shift some “junk” from one side to the other 
and then to transform the lightened side suitably; then, if necessary, 
to shift parts back from the other side, and so forth. 

An old example for such a procedure is the following problem from 
Abu Kamil’s Algebra (about 880 A.D.): 

Under what conditions is it possible to solve the equation 

— x 2 + 2 bx + c = □ ? 

(Here □ stands for a square). The answer given by Abu Kamil is: 
If c < 0 then —c must be less than b 2 , and b 2 + c must be a sum of 
two squares. It is unlikely that Abu Kamil already was aware of the 
following subtle difference: while the second condition is necessary 
and sufficient for the existence of a solutions within the rationals, the 
first one is necessary and sufficient for the existence of a solution in the 
reals. The argument here is a typical case of shifting parts between the 
two sides of an inequality, adopting the fact that in the reals, squares 
are just positive numbers: 

— x 2 + 2 bx + c>0 Ox 2 — 2 bx <=>■ c + b 2 > x 2 — 2 bx + b 2 

c + b 2 > (x — b ) 2 =$■ c + b 2 > 0 —c<b 2 . 

Notice that the implication arrow may be inverted if x is close enough 
to b. 

Of course, the treatment of similar inequalities, requiring the possi- 
bility of adding or subtracting values on both sides, or of multiplying 
them simultaneously by positive numbers etc., relies on the properties 
of an ordered (semi)group or field. 

It is not difficult to make precise what is generally needed in order to 
shift parts of an (in) equality from one side to the other: the adequate 
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description of such situations is by means of adjoint pairs of maps 



P 




Q 



between (quasi-) ordered sets, subject to the defining condition 

(*=?) P < q{q) & Hp) < 9, 

which turns out to be equivalent to the requirement that A and g be 
isotone (i.e. order-preserving) maps satisfying the inequalities 

P < e(A(p)) and A (g(q)) < q. 

As usual, we mean by a quasi-ordered set one with a reflexive and 
transitive relation, called the order relation. If the relations involved 
in an adjunction are partial orders (i.e. antisymmetric), then the left 
or lower adjoint A is uniquely determined by its right or upper adjoint 
g, and conversely. Notice that rotating the above little diagram by 
180° leads to an adjoint pair (g, A) between the dual posets Q d and 
P d . In the introductory example, A could stand for the addition of 
a constant, and g for the subtraction of the same constant. More 
generally, in any (additively written) partially ordered group G with 
isotone translations 

r a : x i-* x + a 

the “subtractions” 

o a \yr+y- a 

are both left and right adjoint to r 0 , being inverse to r a . Generally, a 
poset with a binary operation * is said to be residuated iff each of the 
translations x ^ x * a and x i — > a * x has a right adjoint (see Chapter 

4). 

Notice that one always may take, as a special instance of a partial 
order, the equality relation =. In that specific case, an adjoint pair of 
maps is just a pair of mutually inverse bijections, and the usual treat- 
ment of equations by shifting parts of one side to the other appears 
as a special case of the general handling of inequalities. 

A similar, perhaps somewhat unexpected type of adjunction arises 
when the first quasi-order is an equivalence relation and the second is 
the identity relation. For example, the “world of congruences modulo 
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a prime number p” is translated into the “world of the Galois field 
F p = Z/pZ” (consisting of the residue classes modulo p), by means of 
the quotient map 

A : Z — y F p , n4n + pZ. 

Any left inverse g : F p — > Z of A , picking one representative from each 
residue class, is both left and right adjoint to A, since 

A (n) = q n = g(q) mod p. 

Thus, all statements, problems and solutions in Gauss’ theory of con- 
gruences modulo a prime number [61] may be translated into the the- 
ory of Galois fields F p , and vice versa. 

An obvious question in the present context is this: which maps be- 
tween ordered sets do have a left or a right adjoint? Although the 
answer is very simple, it seems not to have been stated explicitly be- 
fore the middle of the 20th century (see e.g. Pickert [114], Schmidt 
[123]). Calling sets of the form 

(p] = {x e P : x < p} 

principal ( order) ideals and sets of the form 
[p) = {x e P : p < x} 

principal dual ideals or principal filters, one has the following 

Characterization Theorem for Adjoint Maps 

(1) A map A : P — )■ Q between quasi-ordered sets has a right adjoint 
if and only if it is residuated, i. e. inverse images of principal ideals 
under A are again principal ideals. If P is a poset, the right adjoint 
g : Q — >■ P is then given by 

g(q) = max{p G P : A (p) < q) . 

(2) Dually, a map g : Q — »• P has a left adjoint if and only if it is 
residual, i. e. inverse images of principal dual ideals under g are again 
principal dual ideals. If Q is a poset, the left adjoint A : P — > Q is 
given by 

A (p) =min{5 eQ:p< g(q)}. 

(3) Any residuated map between posets preserves all existing joins, and 
any residual map all existing meets. 
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( 4 ) Conversely, any join (meet) preserving map between complete lat- 
tices is residuated (residual). 

Indeed, if a map A : P — » Q between complete lattices preserves joins 
then 

q‘-Q->p, \/{p e p ■ Mp) < q} 

is the right adjoint of A; and dually, if g : Q — >■ P preserves meets then 

a : P— >■ Q, p i-t A {q £ Q : p < q{q)} 

is the left adjoint of g. 



1.2 Closure and Kernel Operations 

Adjunctions are intimately related to the notion of closure and ker- 
nel operations. Set-theoretical closure and kernel operators are well- 
known from algebra, topology, geometry and many other mathemati- 
cal disciplines. The easiest order-theoretical abstraction is the follow- 
ing: a closure operation of a partially ordered (or quasi-ordered) set 
P is a map 7 : P — )■ P such that 

x < 7 (y) ^ l(x) < 7 (y)- 

In other words, closure operations are in one-to-one correspondence 
(via surjective corestriction) to left adjoints of inclusion maps between 
ordered sets. Kernel operations are defined dually; hence, their surjec- 
tive corestrictions are just the right adjoints of inclusion maps. Calling 
a subset C of a poset P a closure range if for each x € P there is a least 
x G C with x < x, one immediately verifies that 7 c : P — >■ P, x i->- x 
is a closure operation, and that the assignment C 1 — > 7c yields a dual 
isomorphism between the poset of all closure ranges in P (ordered by 
inclusion) and the poset of all closure operations of P (ordered point- 
wise). The closure ranges in a complete lattice are just the meet-closed 
subsets. Analogous statements hold for kernel operations and kernel 
ranges. 

On the other hand, an arbitrary adjunction (A ,g) always gives rise 
to a closure operation go\ = A g and a kernel operation A o g — gX, 
because of the inequalities p < g(X(p )) , A (g(g)) < q and the resulting 
equations 
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A gX = A, gXg = g . 

(In order to be flexible in due course, we use two notations, A g and 
go\, for the same composition of maps, given by 

pXg = (A g)(p) = goX{p) = g(X{p)) 

and meaning that one applies first X and then g.) 

Denoting the ranges of A and g by PX and Qg, respectively, one has the 
following fundamental result, in the covariant form with isotone maps 
probably formulated first by J. Schmidt [123], while the contravariant 
form with antitone maps is already to be found in Ore’s pioneering 
paper [38]: 

Adjunctions and Closure-Kernel Isomorphisms 

An arbitrary adjunction (A , g) between posets P and Q induces mu- 
tually inverse isomorphisms X : Qg — > PX and g : PA — » Qg. Further- 
more, the fixpoint set of the closure operation 7 = g oA is the closure 
range Qg, and the fixpoint set of the kernel operation k = Xo g is the 
kernel range PX. The original adjunction may be reconstructed from 
these data by X (p) = Xoj(p) and g(q) = goK(q). Under this construc- 
tion, the adjoint pairs of maps between P and Q are in one-to-one 
correspondence with isomorphisms between closure ranges of P and 
kernel ranges of Q. 




-v c ~ ^ • - 

A 

Adjoint pairs compose in the obvious manner: if (A, g) is an adjunction 
between P and Q, and (a, r) is an adjunction between Q and R, then 
(. Xa,rg ) = (ao X, gor) is one between P and R. The essence of the 
previously established correspondence may be formulated as a 

Homomorphism Theorem for Adjunctions 

Every adjunction between posets has a decomposition, unique up to 
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isomorphism , into a “closure adjunction” (constituted by a closure op- 
eration and an inclusion map), a “perfect adjunction” (constituted by 
two isomorphisms), and a “kernel adjunction ” (constituted by an in- 
clusion map and a kernel operation). In particular, every adjunction 
factorizes into an “epi- adjunction” (with surjective left adjoint) and a 
“mono- adjunction” (with injective left adjoint). 



1.3 Galois Connections 

What are Galois connections ? Many authors use that term as a syn- 
onym for what we called adjunctions. But we prefer the convention 
to distinguish between the “covariant” case of adjunctions and the 
“contravariant” case of Galois connections. Thus, inverting the order 
in the second ordered set Q, one arrives at the classical definition of 
Galois connections (alias Galois correspondences), where the defining 
equivalence becomes “symmetric”: 

v < q(q) o q< Hp)- 

A well-known equivalent definition of Galois connections, due to Ore 
(see [38] and Section 3.3) requires that A and g be antitone (i.e. order- 
inverting) maps satisfying 

P < q{Hp)) and q < X(g(q)). 

By definition, both compositions of the partners of a Galois connec- 
tion are closure operations, and their ranges are dually isomorphic. 
Moreover, by the above correspondence between adjoint pairs and 
hull- kernel isomorphisms, 

every pair of mutually inverse dual isomorphisms between 
closure ranges is induced by exactly one Galois connection 
between the original ordered sets. 

While adjoint maps are isotone (order-preserving), the partners of a 
Galois connections are always antitone (order-reversing). Certainly, 
an advantage of adjunctions, compared with Galois connections, is 
that they compose in a natural way. However, there is also a (less 
natural) way of composing Galois connections (see Picado [113]). The 
fixpoints on both sides of a Galois connection are said to be Galois 
closed (while in an adjunction (A, g), the fixed points of Ao g are said to 
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be Galois open, because Xog is a kernel operation). The fundamental 
problem of Galois connections is to determine or characterize (in some 
practical way) the Galois-closed objects. In concrete situations, this 
is often a difficult task (as we know from classical Galois theory). A 
rather easy example is 

The Galois Connection Between Subsets and Isotone Self- 
maps 

For any subset Y of a complete lattice L, the map 
A (T) = Ay i L — ^ L, x i — ^ f\{y £ Y i x ^ ?/} 
is a closure operation. 

On the other hand, for any isotone self-map 7 of L, the set 
g( 7) = {x E L : 7(2;) < x} 

is meet-closed, hence a closure range. The pair (X,g) is a Galois 
connection between the power set of L and the pointwise ordered com- 
plete lattice of all isotone self-maps of L. The Galois-closed subsets 
are precisely the closure ranges, and the Galois-closed self-maps are 
the closure operations. Thus, the above Galois connection induces the 
known dual isomorphism between the complete lattice of closure oper- 
ations and that of closure ranges. 

Indeed, an isotone map 7 : L — >■ L is Galois closed iff A (2/(7)) < 7, 
which means that /\{a E L : x < a and 7(a) < a} < j(x) for all x E 
L. Clearly, this implies x < 7(2;), and if x < 7 (y) then y < a and 
7 (a) < a entail j(x) < 7(7(2/)) < 7(7(0)) < 7 (a) < a, whence 

j(x) < f \ {a E L : y < a and 7(0) < 0} < 7(2/). 

Thus, 7 is a closure operation. Conversely, if the latter is assumed 
then for x E L, we have x < 'j(x) and 7(7(2;)) < 7(2;), hence 

A {a £ L : x < a and 7(0) < a} < j(x). 

That the Galois-closed subsets are just the meet-closed ones is easily 
checked. Specifically, the Galois-closed subsets of power sets (under 
the Galois connection between subsets of the power sets and inclu- 
sion preserving maps) are just the closure systems, i.e. those systems 
which are closed under arbitrary intersections; and the Galois-closed 
inclusion-preserving maps are precisely the closure operators. 

Many authors speak of a perfect adjunction or a perfect Galois connec- 
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tion if both partners are surjective (respectively, all involved elements 
are Galois closed). Helpful tools in the determination of perfect ad- 
junctions or Galois connections are often certain scales ; most gener- 
ally, these are merely functions p on a poset P so that p < p' implies 
p{p ) ^ p(p'). In concrete cases, such a scale may be a cardinality, 
dimension or degree function, etc. Clearly, every injection is a scale, 
but not conversely. 

Epi- Adjunctions and Right Perfect Galois Connections 

For an adjunction or Galois connection (A ,g) between quasi-ordered 
sets P and Q, the following conditions are equivalent: 

(a) Xog = idQ. 

(b) A op is injective. 

(c) Xog is surjective. 

(d) g is injective. 

(e) A is surjective. 

(f) There is a map p : P — >• N such that pog is a scale. 

(g) There is a scale p : Q — >• N such that p = poXog. If Q is partially 

ordered , (g) may be replaced with 

(h) There is an isotone scale p : Q — >■ N such that p < po Xog. 

The implications (a)=^(b)=^(d)=^(h)=^(g)=>(f) and (a) =^(c) =^(e) 
are obvious. For (f)=>-(a), note that Ao g(q) < q and po goXo g(q) = 
ppg(q), hence Xog(q) = q. Similarly, (e) implies (a) because XogoX = X. 

Combining that result with its dual (exchanging left and right), one 
obtains the 

Theorem on Perfect Galois Connections 

For an adjunction or Galois connection (A ,g) between quasi-ordered 
sets P and Q, the following conditions are equivalent: 

(a) A and g are mutually inverse bijections. 

(b) Xog and goX are injective. 

(c) Xog and goX are surjective. 

(d) A and g are injective. 

(e) A and g are surjective, (f) There are maps p:P—>N, p:Q—>N 
such that poX and pog are scales. 

(g) There are scales p :P—> N, p :Q N with p = poX, p = pog. 
If P and Q are partially ordered, (g) may be replaced with 
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(h) There are isotone scales pb:P—>N, p:Q^N with fi<po\ 
and p < /aog. 




For (a) =>■ (g) and (h), take 

At : P ->■ PxQ, P (p,X(p)) and p : Q ->• PxQ, q i-> (g(q),q). 

Note that two maps A and g form a perfect adjunction iff g is both 
a left and a right adjoint of A, which means that they are mutually 
inverse isomorphisms. Thus, a perfect Galois connection consists of 
two mutually inverse dual isomorphisms. 



1.4 Polarities and Axialities 

A fundamental fact concerning Galois connections, pointed out in the 
early sources by Birkhoff [3], Everett [54] and Ore [38], is that all 
polarities in the sense of Birkhoff, that is, all Galois connections be- 
tween power sets, may be constructed in a unique way from relations 
between the underlying sets. The partners of the induced Galois con- 
nection associate with any subset of the one set the collection of all 
elements of the other that are in relation to each element of the former 
subset. We shall focus on that topic in Sections 3.2 and 3.3, but for 
easy reference, we sum up already now the basic connections between 
relations, polarities und dualities of closure systems. 

Relations and Polarities 

Every relation R between two sets A and B gives rise to a Galois 
connection (R~^,^R) between the power sets of A and B, by sending 
X C A to the set R~^(X) = X R = {y £ B : xRy for all x € A}, 

Y C B to the set *~R(Y) = Yr = {x £ A : xRy for all y £ Y}. 

The ranges of these two maps are dually isomorphic closure systems. 

Conversely, any Galois connection between power sets and, conse- 
quently, any dual isomorphism between two closure systems comes 
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from a unique relation R between the underlying sets in that manner, 
where xRy means that y belongs to the image of the closure of {x} 
under $. 

For example, this construction provides a natural framework for the 
Fundamental Theorem of Galois Theory, where the relation in question 
is the “fix relation” between automorphisms and elements of a field: 

aRy O- a{y) = y. 

One map of the induced Galois connection sends each subset of the 
automorphism group to the subfield of all elements fixed by these au- 
tomorphisms; and in the opposite direction, the other map sends each 
subset to the group of all automorphisms fixing the elements of that 
set. Each Galois-closed subset of the automorphism group is then 
a subgroup, and each Galois-closed subset of the field is a subfield. 
The important (and more difficult) part of the Fundamental Theo- 
rem of Galois Theory states that in the case of a Galois extension, 
this restriction yields a dual isomorphism between the closure system 
of all intermediate fields of the extension and the closure system of 
all subgroups of the corresponding automorphism group (the “Galois 
group”). However, that is not the way Galois himself attacked the 
solvability of polynomial equations. We shall come back to that clas- 
sical theme when we discuss Galois’ famous memoir [56, 57] in Section 
2.3. 

Of course, there is also a covariant counterpart of polarities, obtained 
by composing the polarities with dual isomorphisms via set comple- 
mentation. Although such axialities , i.e. adjunctions between power 
sets, are quite natural, they seem to be less common than polari- 
ties. Let us summarize the main facts about such adjunctions be- 
tween power sets and the inducing relations (see the Primer on Galois 
Connections [15] for details). 

Relations and Axialities 

Every relation R between two sets A and B gives rise to an adjunction 
(~^R, R^) between the power sets of A and B, by sending X C- A to 
the set TR(X) = {y G B : xRy for some x e X}, Y C B to the set 
R i ~(Y) = {x EA : xRy implies y G Y}. 

Hence, the range of ~*R is a kernel system, the range of R*~ is a 
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closure system, and the two systems are isomorphic via restriction 
and corestriction of these maps. 

Conversely, any adjunction between power sets and, consequently, any 
isomorphism $ between a closure system and a kernel system comes 
from a unique relation R between the underlying sets in that manner, 
where xRy means that y belongs to the image of the closure of {x} 
under $ . 

Polarities and axialities are related via the complement operator c by 
the equations 

R(X) C = R C ^(X), R^(Y C ) = <- R C (V ). 

Notice that if F is a function rather than a general relation, then 
the above sets ~*F(X) and F < ~(Y) denote the usual image and preim- 
age sets, respectively. Thus, in particular, any function F : A — >■ B 
gives rise to an adjoint pair of “lifted maps” ~^F and F between 
the corresponding power sets. As usual, we write F(X) for ~*F(X) if 
no confusion is likely to arise, but we avoid the ambiguous notation 
F -1 (F) for F*~(Y). The adjointness condition is here the well-known 
equivalence 

F(X) C Y <£> X C F<~(Y). 

For more examples not mentioned in this survey, see [15]. 

The aforementioned correspondence between relations and axialities 
may be represented in the form of a further perfect adjunction: 

Isomorphism Theorem for Axialities 

Assigning to each relation R the map ~^R yields an isomorphism be- 
tween the power set of Ax A (i.e. the set of all relations on A) and the 
complete lattice of all residuated maps on the power set of A. More- 
over, this is also a semigroup isomorphism, sending the composition 
product of relations to the composition product of mappings. 

In order to demonstrate how polarities and axialities may simplify ar- 
guments and computations, let us conclude this section with a typical 
class of examples arising frequently in elementary linear algebra. 

Let D be a division ring (skew field) and D” the space of raxn-matrices 
over D. Note that by the associative and distributive laws for matrix 
operations, D™ is a ring (with unit), is a left D^-module, and 
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D™ is a right D^-module. Denote by £(D”) the closure system of all 
left D™-submodules of and by 7l(D™) the closure system of all 
right D™-submodules of D™. 

We now consider, for any three natural numbers k, m, n, the orthogo- 
nality relation ±=^_L m between A = D % and B = D™ given by 

x _L y xy = 0. 

Again by the associative and distributive laws, it is clear that for 
any subset X of A, the Galois-closed set X 1 belongs to 7 Z(D™), and 
on the other hand, for any subset Y of B , the Galois-closed set Y± 
belongs to £(D%). The point is now that, conversely, every submodule 
S G £{D%) (and every T G 7Z(D™)) arises in that fashion. For the 
proof, one has to find for any x G A \ S a matrix y G B such that 
sy = 0 for all s G S but xy 0. Since S' is a left D|-module, it is easy 
to see that the rows of all matrices in S form a subspace U of the row 
space D", and that S consists precisely of those matrices whose rows 
belong to U. Thus, x S means that at least one row Xj of x is not 
in U. Therefore, we find a basis {bi, ..., b n } of D” so that b\ = Xj and 
{b 2 , ...br} is a basis for U. Solving the system of linear equations 

£r=i%* = (i,o,...,o) , 

we get a column y G D * with Xjij = biy = 1 and biy = 0 for i > 1, 
hence uy = 0 for all u G U. Consequently, the matrix y = (y, ...,y) G 
D ™ has the required properties xy ^ 0 and sy = 0 for all s G S. 

Thus, we have established the 

Dual Isomorphism Theorem for Matrix Submodules 

For any division ring D and natural numbers k, m, n, the polarity 
associated with the orthogonality relation between D% and D™ induces 
a dual isomorphism between the left submodule lattice C(D%) and the 
right submodule lattice In particular, for fixed n, each of the 

lattices C(D%) is isomorphic to the row subspace lattice and 

each of the lattices 7 Z{D™) is isomorphic to the column subspace lattice 

noi). 

Notice that, specifically, £(D”) is the lattice of all left ideals in the 
matrix ring and 7Z{D r fi) is that of all right ideals. 

In the present context, not only polarities, but also certain axialities 
are of particular use. Consider the “column relation ” C between D% 
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and D * given by xCy iff y belongs to the column space of x, i.e. 
xb — y for some b G D\. 

Isomorphism Theorem for Ideals of Matrix Rings 

The axiality associated with the column relation between D ” and D * 
induces an isomorphism between the left ideal lattice £(£)”) and the left 
subspace lattice C{D]f). Moreover, every left ideal of D™ is principal. 

However, it should be emphasized that not all Galois-open subsets of 
D * with respect to that axiality are subspaces, but the Galois-open 
sets are the unions of subspaces, hence form an Alexandroff topology 
(closed under arbitrary unions and intersections). The isomorphism 
theorem is an immediate consequence of the previous remarks concern- 
ing submodules, i.e. ideals of D", and the associated column spaces. 

Summing up the previous definitions and facts, we have four orthog- 
onality relations 



T = 


i-LrC DfxDi 


with 


orthogonal spaces X x and Y±_ 


K = 




with 


right kernel spaces X K 


L = 




with 


left kernel spaces Y L 


N = 


n ± n C D%X D™ 


with 


annihilator ideals X N and Y n 



Furthermore, we have the column relation 
C C D™ x D „ with column spaces ~^C(X) 

and the row relation 

R C D" x D™ with row spaces ~^R d (Y) 

where R d denotes the dual of the relation R. These six relations are 
connected via the relation product 

RS = {{x,z) : xRy and ySz for some y} 

as indicated in the following commutative diagram: 
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Thus, NC = K, RN = L, LC =_L, RK =±, and so RNC =_L . 

Lifting the relations to the associated polarities, respectively, axialities 
and suitable restriction yields the following commutative diagram of 
isomorphisms (perpendicular) and dual isomorphisms (horizontal or 
diagonal) : 




These isomorphisms are the starting point in John von Neumann’s in- 
genious coordinatization theory for complete, complemented modular 
lattices [145] . 



1.5 Formal Concept Analysis 

This modern, practice-oriented and mathematically founded theory 
has been initiated and propagated by Rudolf Wille and his coworkers 
since the seventies of the last century. Comprehensive sources about 
that theme are the books [58] and [59], the latter containing also a 
long list of references to further research on the subject. 

Formal concept analysis combines the mathematical ideas of polarities 
and lattice theory with philosophical concepts (in particular, that of 
intent and extent), with methods of ordering and classification, and 
with diverse algorithmic tools for applications, not only in mathe- 
matical but also in various extra-mathematical areas, from technical 
engineerings, biology, medicine, history and social sciences up to fine 
arts and music. Some of the prominent aims and topics formulated by 
Wille in [153] are: 

• Hierarchical classification of objects 

• Minimal systems of implications between sets of attributes 

• Combination of individual concept systems 
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• Reduction to basic concepts 

• Conceptual scaling and measuring 

• Structures for the representation of knowledge data 

But Wille also views formal concept analysis as an important tool for 
the purpose of “restructuring lattice theory". In the introduction to 
his inspiring article [153], he writes: 

... we go back to the origin of the lattice concept in nineteenth- 
century attempts to formalize logic , where the study of con- 
cepts played a central role (cf. Schroder [131]). Tradi- 
tional philosophy considers a concept to be determined by 
its “ extent ” [extension] and its “intent” [intension, com- 
prehension]: the extent consists of all objects (or entities) 
belonging to the concept while the intent is the multitude 
of all attributes (or properties) valid for all those objects. 

Let us explain briefly the mathematical framework for these ideas 
and the fundamental notions of formal concept analysis. For more 
background, see [59] and [6, 48, 49]. 

Mathematically, a (formal) context is a triple K = ( J, M, I), consisting 
of a set J (of “subjects” or “objects”), a set M (of “marks” or “at- 
tributes”) and an ( “incidence”) relation I C J x M. The concepts are 
the pairs ( X , Y) with X = 1} C J (the “extent”) and Y = X 1 C M 
(the “intent”). Ordered by “specialization” , i.e. inclusion of the first 
component or, equivalently, by dual inclusion of the second compo- 
nent, they form a complete lattice, the so-called concept lattice BK. 
Obviously, this lattice is isomorphic to the closure system of all ex- 
tents (via projection onto the first component) and dually isomorphic 
to that of all intents (via projection onto the second component). The 
resulting dual isomorphism between these two closure systems is just 
that induced by the polarity associated with the relation I. If the 
maps 

7 : J— BK, j h* ({jYj, {j} 1 ) and pi : M-+BK, m ^ ({ra}j, {m}/) 

are injective, one speaks of a purified context. Specific purified contexts 
are the basic contexts or L-contexts , where J is a join-dense subset of 
a complete lattice L (i.e., every element of L is a join of elements 
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in J), M is a meet-dense subset of L (defined dually), and / is the 
relation between J and M induced by the order relation of L. As we 
saw before, a context is purified iff it is isomorphic to a basic context. 
Moreover, every context K has a canonical “purification” Ko , obtained 
by passing from j to 7 (j), from m to p(m), and from I to the relation 
induced by the order of £?K. By definition, Ko is a basic context such 
that K and Ko have isomorphic concept lattices. Indeed, the concept 
lattice of any basic context is isomorphic to the original lattice L, by 
sending x 6 L to the concept 

( Jfl \,x, Mn t x ) = ({ j € J : j < x}, {m £ M : x < m }) . 

More precisely, one has the 

Fundamental Theorem of Formal Concept Analysis 

An arbitrary complete lattice L is isomorphic to the concept lattice of a 
context K = ( J, M, I) iff there is a join-dense embedding 7 : J — >■ L and 
a meet-dense embedding ft : M — >■ L such that jlm 7 (j) < jx{m ) 

in L. 

This theorem extends the well-known characterization of the Dedekind- 
MacNeille completion or completion by cuts of an arbitrary poset as 
the complete lattice (unique up to isomorphism) in which the poset is 
join- and meet-densely embedded (see [3, 28, 91] and Section 3.2). 

The interplay between contexts and complete lattices is very fruitful in 
both directions: for a given context, a careful inspection of the concept 
lattice provides insight into hierarchies, implications and independen- 
cies that wouldn’t be obvious from large context tables. On the other 
hand, at least every finite lattice and, more generally, every doubly 
based lattice, that is, every complete lattice L whose \/-irreducible el- 
ements form a join-dense subset J and whose /\-irreducible elements 
form a meet-dense subset M, is coded entirely by its basic context 
( J, M, I). The latter may be regarded as a “logarithm” or a “base” of 
the whole lattice, usually having much smaller size than L. As pointed 
out in [49] and in [52], the passage from a context to its concept lattice 
and, in the reverse direction, from a complete lattice to the context 
of irreducible elements, provides a functorial adjunction between the 
category of complete lattices with complete homomorphisms and the 
category of contexts with so-called conceptual morphisms. Restriction 
to doubly based lattices on the one hand and to purified contexts on 
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the other hand even yields a categorical equivalence. 

A useful generalization are the so-called multi-valued contexts , where 
the incidence relation is substituted by a map l : J x M -* N. Of 
course, this includes the usual notion of contexts, by taking the two- 
element set {0, 1} for A” and replacing relations by their characteristic 
functions. But the multi-valued approach is much more flexible and 
has manifold modern applications, for example to fuzzy concept analy- 
sis and fuzzy logic [20, 21, 140], relational data bases [23], information 
systems [111, 112], and Chu spaces [115]. 

Some hundreds of further references to themes involving adjunctions 
and Galois connections may be found via the MathSciNet using the 
relevant key words. 



2 The Roots of Galois Connections 

It is almost impossible to trace back the idea of adjunctions and Galois 
connections to its historical origins. Actually, as a mathematical con- 
cept they probably did not emerge before the 20th century. However, 
they are already involved, more or less implicitly, in certain construc- 
tions and computations of earlier centuries. Of course, the examples 
selected below are far from being representative for all such occur- 
rences in ancient mathematical contexts. Nevertheless, they show how 
certain considerations are made easier or at least more transparent by 
the language of adjunctions or Galois connections. 

We shall start with the theory of symmetric functions, initiated by 
Girard, Newton, Waring and Vandermonde. Then we touch upon La- 
grange’s ingenious contributions to the solution of algebraic equations 
and, of course, we discuss Galois’ pioneering work on the same theme. 
Among the historical background literature on the theory of polyno- 
mial equations, we recommend the recent book by J.-P. Tignol [138], 
which combines historical thoroughness with mathematical accuracy. 

Then we pass to the field- and ring-theoretical ideas of Dedekind de- 
veloped in his early Lectures on Algebra [26], and we have a look at 
Hilbert’s “Nullstellensatz” from the perspective of Galois connections. 
Other interesting aspects are Schroder’s contributions to universal al- 
gebra in the light of Galois connections and Dedekind’s early inves- 
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tigations on closure operators, lattices ( “Dualgruppen” ) and lattice- 
ordered groups (to be discussed later in Section 4.2). 

2.1 Prelude on Symmetric Functions 

An equation consists of several terms, some known and some 
unknown, some of which are together equal to the rest; or 
rather, all of which together are equal to nothing; for this is 
often the best form to consider. 

Translated from: Rene Descartes, La Geometrie 

Historically, the starting point for Galois theory, and hence for the 
appearance of Galois connections, were investigations concerning the 
solvability of polynomial equations. For Galois and Lagrange, a crucial 
tool in that undertaking were so-called symmetric (rational) functions 
and, in particular, symmetric polynomials. A polynomial or rational 
function in several variables is symmetric iff it remains fixed under 
the exchange of any two variables, and consequently, under arbitrary 
permutations of all variables. A rational function that is invariant 
only under a certain subset T of permutations may be called partially 
symmetric (with respect to T ). On the other hand, for a given rational 
function /, those permutations of the variables that leave / unchanged 
form a group, the isotropy group or stabilizer 1(f). 

By its very definition, the notion of (partially) symmetric functions 
comes from an elementary Galois connection: the “fix relation ” 

0 H / «■ D(f) = ! 

gives rise to a a polarity in Birkhoff’s sense (cf. Sections 1.4 and 
3.2) between S(X), the full permutation group of X = {aq, ..., x n } 
and K(X), the field of rational functions over a ground field K. This 
polarity associates with any set T C S(X) of permutations the subfield 
T H of all rational functions in K(X) fixed by each 0 G T, and in 
the opposite direction with any set F of rational functions the group 
F-\ = P| {1(f) : f € F} of all permutations fixing each / e F. In 
particular, S(X )h is the field Ks(X) of all symmetric functions. 

The following easy result, mentioned in Dedekind’s Lectures on Al- 
gebra [26] (see Section 2.4), may be regarded as a forerunner of the 
Fundamental Theorem of Galois Theory. 
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Theorem on Partially Symmetric Functions 

The assignment f i->- 1(f) is a surjection from K(X) (and even from 
the polynomial ring K[X ]) onto the subgroup lattice of S(X). Hence, 
the Galois connection associated with the fix relation induces a dual 
isomorphism between the subgroup lattice of the symmetric group S(X) 
and the closure system formed by the “fields of partially symmetric 
functions’’ , i.e. those subfields of K(X) that consist of all rational 
functions fixed by the members of a subgroup of S(X). 

Indeed, given a subgroup G of S(X), one easily finds polynomials 
/ G K[X] that are fixed precisely by the members of G , for example 

f( x u x n) = EUc EL i °{ x k) k ■ 



Now to the history of symmetric functions. The elementary symmetric 
functions in the variables x\, ... ,x n , 

Sk = Sk,n = £!<*<...<*<„ n m= i x im = Eycx,#y=fc Flyey V j 
are, up to signs, the coefficients of the polynomial 

a result probably due to Albert Girard (1629) in its full generality 
(see [64]), while specific cases for small degrees were already known to 
Francois Viete (Francesco Vieta; see [143]). 



Historians attribute the first major results on symmetric polynomials 
to Isaac Newton (see, for example, [39]). About 1666, Newton listed in 
his papers a whole sequence of formulas relating symmetric functions 
of the roots of a cubic polynomial with its coefficients. In these notes, 
one also finds the case n = 8 of the famous formula 

Ej — i ( 1)^' Cj,n Sk- j,n T hSk,n 0 

connecting the power sums 
a j,n = Ei=l ^ 

with the elementary symmetric functions s^n- This formula occurs, in 
a slightly different form and without proof, in Newton’s Arithmetica 
Universalis from 1707. It enables one to compute recursively the 
power sums from the elementary symmetric functions, and vice versa, 
by the earlier mentioned process of shifting parts of the equation from 
one side to the other. At least for n < 4, the formula was already 
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known to Girard [64], 

As a consequence of Girard’s equation (*), expressing the coefficients 
of a polynomial as symmetric functions of the roots, one has the follow- 
ing formulas providing the passage between the elementary symmetric 
functions in n and in n — 1 variables, respectively: 

(T) Sk,n = Sk,n — 1 T ^k— l,n — 1 %n j 

( — ) s k,n-l = ^2j—o s k-j,n{~ x nY • 

Although Newton wrote down only a list of specific formulas involving 
symmetric polynomials, it is assumed today that he was aware of the 
general 

Fundamental Theorem on Symmetric Polynomials 

Every symmetric polynomial may be expressed uniquely as a polyno- 
mial in the elementary symmetric functions. 

Mathematicians of the 18th century used that theorem without any 
inhibition. Lagrange qualifies it as “self-evident” but mentions that 
a “rigorous demonstration depends on the principles ” developed in 
his memoir on the solvability of algebraic equations [85]. Construc- 
tive proofs (by recursion) of the existence part appeared at the same 
time in Waring’s Meditationes Algebraicae [152] and in Vandermonde’s 
Memoire sur la resolution des equations [141], while the uniqueness 
part seems to have been neglected before Gauss’ demonstration in 
[60]. The first complete and rigorous proof of both parts and a correct 
argument for the extension to rational functions is probably due to 
Dedekind [26] (see Section 2.4). Today, we find many different proofs 
in the textbooks, some of them relying on parts of the more general 
Galois theory. But the historical order was just the converse, starting 
with Girard’s formula and culminating in Lagrange’s and Galois’ the- 
ory of polynomial equations, based on the Fundamental Theorem on 
Symmetric Polynomials. 

For the sake of convenience and easy reference, we add here two slightly 
stronger order-theoretical results in terms of rings; the first one ex- 
tends Waring’s method of difference exponents [152], and the second 
imitates Dedekind’s reduction method from n to n — 1 variables [26]. 
Of course, the ring-theoretical terminology is borrowed from the math- 
ematics of the 20th century, but it could be replaced easily with (less 
definite) phrases in the language of Waring, Lagrange and Galois. 
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Let R be an integral domain, that is, a commutative ring with 1 and 
no zero divisors. We denote by i?[X] the ring of all polynomials over 
R in a set X of variables; if X = {x}, we put, as usual, R[x] = R[X}. 
The symbol i?s[Af] will denote the subring consisting of all symmetric 
polynomials (fixed by all permutations of X). Mathematicians are 
familiar with the representation of polynomials as linear combinations 
of monomials. But what is a monomial? The easiest, though rather 
abstract definition uses the additive monoid of all functions ip 
from X to co = {0,1,2,...} with finite support {x E X : tp(x) ^ 0} 
and identifies monomials as functions from u into R having value 1 
for one argument m E and value 0 for all others. Denoting that 
monomial by x m , the product of two monomials is then given by “the 
exponential law” x m x m ' = x m+m ' . 

Starting with 1 instead of 0, we mean by cu n the set of all finite se- 
quences of natural numbers, and by a;” the subset of 

all decreasing sequences among them. These sets are ordered lexico- 
graphically in the usual way (from left to right). That lexicographical 
ordering is a well-ordering, so it may be used for induction proofs as 
in the case of natural numbers. The “reverse sum operator” 

S :oo n ->LO™, (mi,...,m n )y-+(m 1 +... + m n ,...,m n -i+m n ,m n ) 

and the “reverse difference operator” 

A : u™ -»u/\ (mi, ...,m n ) (m 1 -m 2 , ..., m„_i- m n , m n ) 

induce mutually inverse bijections between oj n and u>". (However, S 
and A become isomorphisms, hence form a perfect adjunction only if 
co n and are ordered anti-lexicographically from right to left). For a 
finite set 



one has a canonical isomorphism between the additive monoid ui n 
and the multiplicative monoid of all monomials over X n , sending 
m = (mi,...,m n ) to x m = {{a;™ 4 . Now, the (multi-) degree fi(f) of a 
non-zero polynomial / E R[X n \ is defined to be the lexicographically 
largest “exponent” m of monomials occurring in / (supplemented by 
the setting //(0) = — oo). Thus, one may write / in the form 

/ = A (f)x^ +r 

where A (/) is the “leading coefficient ”, is the “leading mono- 
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mial”, and r is the “remainder polynomial” having smaller degree 
than /. A polynomial with leading coefficient 1 is said to be monic. 
(Note that our definition requires monomials to be monic.) 

There is a well-defined substitution homomorphism 

<r : -R[A n ] Y .RgjA^], f (x\i ..., x n ) i Y f (si >n , ..., 

Now, the Fundamental Theorem on Symmetric Polynomials amounts 
to a 

Perfect Adjunction Between Polynomials and Symmetric Poly- 
nomials 

The substitution homomorphism a induces an algebraic and order- 
theoretical isomorphism between the rings R[X n \ and Rs[X n ], ordered 
by divisibility. 

The inverse isomorphism 8 : i?,s'[A" n ] — >■ R[X n ] may be defined recur- 
sively by 5(0) = 0 and 

5(s) = A(s)a;^ s ^ + 8(s) otherwise , where 

fi(s) = A (p(s)), s — s A(s)a(a: A(s) ) e Rs[X n ], fi(s) < p(s). 

Furthermore, // and ft satisfy the degree equations 

Vif 9) = t*(f) + M and fi{st) = fi(s) + fi{t). 

In particular, they induce isotone scales on the partially ordered sets 
of all monic polynomials and of all symmetric monic polynomials, re- 
spectively. 

Proof. The first of the latter two equations is an easy consequence of 
the (distributive) product definition for polynomials, and the second 
follows from the first by the fact that A preserves sums. Note that 
the degree /i,(s) of a symmetric polynomial s is in fact decreasing, by 
definition of the lexicographical order. Hence, // and ft are scales pre- 
serving the divisibility relation. Iterated application of the equation 

p(fg) = p(f)+M y ie i ds 

p(a(x m ))= p(U S jt n m i) =E m i/ i ( S J>) = T,(m). 

As £ is inverse to A, we conclude that 
p,{a{x^)) = E(j5(s)) = E(A(//(s))) = p{s). 

From this, we infer the inequality 
p{s) = fi(s - A (s)a(x^ s) )) < fi(s ) , 
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which makes the recursive definition of 5 well-founded. 

Now, it is easy to prove inductively the equation er(5(s)) = s for all 
symmetric polynomials s E ifelW], starting with a (5(0)) = 0: 

<j(<5(s)) = A(s)a(x^) + a(S(s — X(s)a(x^))) = s. 

As E is injective, so is p o a on monomials. Hence, for any non-zero 
polynomial / = ^ a m x m , in the linear combination 

a (f) = Y, a mv(x m ) 

any two summands have distinct leading monomials. The largest of 
them is also the leading monomial of cr(f), whence a(f) is not zero. 
This proves injectivity of a and shows that <5 is inverse to a. Since the 
latter is an algebraic isomorphism, so is the former, and a(f) divides 
cr(h) iff / divides h. □ 

Though p(a(x m )) = E(m) entails jl(a(x m )) = A(n(a(x m )) = m = 
ix(x m ), the diagram below unfortunately does not commute! 




LV n U { — oo} 

For example, if t = X\X 2 + X\X 2 and s = a(t) then /r(s) = X\X 2 , 
hence 

/x(a(t)) = jl(s) = X 1 X 2 xi 2 x 2 = n(t) = n(5{s)). 

We now formulate the second extension of the Fundamental Theorem 
on Symmetric Polynomials. 

Characterization Theorem for Rings of Symmetric Polyno- 
mials 

A sequence of rings R n satisfies the conditions 

(0) Rq = R, (1) Rn C R s [X n ], (2) Rn-i C R n [x n \, and 

(3) fn(% n ) = 0 for some monic polynomials f n e R n [y\ of degree n 

if and only if R n = Rs[X n ] for all n. 

We prove the “only if” part by induction (the “if” part is easier). 

By (0), we have R 0 = R = i?s[0] = Rs[-^o]- Assume R n ~i = i? 5 [X n _i] 
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and consider any g G.Rs[Af n ]. Expanding g according to powers of x n 
yields coefficients that are symmetric in X\, ...,x n -\. Thus, 

(4) R s [X n ] C RsiX^ixn], 

and by the induction hypothesis, g belongs to R n -i[x n }. Now, we infer 
from (2) that there is a polynomial h G R n [y\ of minimal degree such 
that h(x n ) = g. This degree must be less than n, because otherwise 
subtraction of a suitable multiple of the polynomial f n in (3) would 
reduce the degree. But by symmetry, the polynomial h—g G Rs[X n \[y] 
has the distinct roots xi, ...,x n , and it would have the same degree as 
h (note that g is here a constant) unless h — g is the zero polynomial. 
Thus, g = h belongs to R n . □ 

The Fundamental Theorem on Symmetric Polynomials now follows 
immediately by taking for R n the ring R[si jn , s n , n ] generated by 
the elementary symmetric functions: (0) and (1) hold by definition, 
the recursion (— ) ensures (2), and the polynomial f n = n fe=i ~ x k) 
satisfies (3). 

It is not difficult (cf. Section 2.4) to derive from the Fundamental 
Theorem on Symmetric Polynomials the 

Fundamental Theorem on Symmetric Functions 

A rational function in n variables over a field is symmetric if and 
only if it has a (unique) representation as a rational function in the 
elementary symmetric functions. 

Another consequence of the Characterization Theorem is 

Corollary 1. Any subset Y of the set X = X n of variables satisfies 
the equation 

B s [x\y][y] = -R s ray]. 

For the proof, observe first that (2) for R n = Rs[X n ] together with 
(4) gives 

Rs[X n - 1 ] [x n ] = Rs[X n ] [x n ]- 

The general formula is then obtained by induction. 

Finally, substituting for the indeterminates the (pairwise distinct) 
roots of a polynomial, one arrives at the useful 

Corollary 2. If a polynomial f G R[x] has the set Z of roots, then 




Adjunctions and Galois Connections 



29 



Rs[Z\Y ] [T] = i?[T] for all Y C Z , in particular 
Rs[Z] = R and Rs[Z\ {z}] [z\ = R[z] for all z £ Z. 

The last two formulas say that any symmetric polynomial (or rational) 
expression in the roots of a given polynomial must already belong to 
the ground ring (or field, respectively), and that a polynomial (or 
rational function) that is symmetric in all but one root is actually 
a polynomial (or rational function) in that root. This is the form 
in which Galois and his predecessor Lagrange used the Fundamental 
Theorem on Symmetric Functions. 



2.2 Joseph-Louis Lagrange: Permutation Groups, 
Partially Symmetric Functions and Interpolation 

Voila, si je ne me trompe, les vrais principes de la resolution 
des equations et V analyse la plus propre a y conduire; tout 
se reduit, comme on voit, a une espece de calcul de combi- 
naisons, par lequel on trouve a priori les resultats auxquels on 
doit s’attendre. 

J. - L. Lagrange in Article 109 of his memoir 
on the algebraic solution of equations (1771) 

About a century after Newton, it was Lagrange who developed a rather 
comprehensive theory of partially symmetric functions. Lagrange’s 
immense mathematical work [85] is one of the most impressive cre- 
ations of human kind. His contributions to the theory of polynomial 
equations [86] and its basic group-theoretical aspects alone made him 
a pioneer in the fundaments of Galois theory. Though not using ex- 
plicitly the abstract notion of a group, Lagrange systematically dealt 
with 

• permutation groups of the roots of a polynomial equation and 

• the change of rational functions under permutations of the vari- 
ables. 

These are the essential tools not only in Lagrange’s own considera- 
tions on the solvability of polynomial equations, but also in the later 
expansion of the theory by Abel, Galois and Dedekind. 
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In order to make Lagrange’s results more easily accessible, we antici- 
pate a few additional ideas and notations due to Dedekind (see Section 
2.4) and feel free to include some modern terminology. Thus, given a 
field K, we denote, as usual, by K(A) the field obtained by adjoining 
the elements of A, and we write K(ai, for K({ai, ...,o m }). In 

particular, as before, K(X ) or K(x i, designates the field of all 

rational functions in the variables xi,...,x m over K (Lagrange, Ga- 
lois and Dedekind preferred the letter m instead of n). Similarly, if 
S = {si, m , ..., s m>m } is the set of elementary symmetric functions in 
m variables, then the Fundamental Theorem on Symmetric Functions 
may be expressed by the identity Ks(X) = K(S). 

Denote by 9(f) or f9 (!) the rational function obtained from / £ 
K( X) by applying the permutation 8 to the variables. Then, in mod- 
ern language, 

1(f) = {«e S(X) : «(/) = /} 
is the stabilizer of / with respect to the symmetric group 
S(X)~S({l,...,m}) = S m . 

In his famous memoir Reflexions sur la resolution algebrique des equat- 
ions [86] , Lagrange implicitly uses the quasi-order 

/ < s » i(g) £ i(f) 

and calls two rational functions similar if they are equivalent with re- 
spect to that quasi-order, meaning that they have the same stabilizer. 
One of the similarity classes is that of all symmetric functions. 

Another useful ingredient in Lagrange’s theory of polynomial equa- 
tions is what nowadays is termed 

Lagrange’s Subgroup Theorem 

The order (size) of any subgroup of a finite group G divides the order 
of G. 

But what Lagrange really stated in Articles 97-99 of his memoir [86] 
is, rephrased in modern terms, the following more concrete 

Theorem on Minimum Polynomials for Rational Functions 

For f £ K(X), the minimum polynomial (the least monic polynomial 
with root f ) over K(S ) is given by 

m f( x ) = U g efsJ x - 9) 




Adjunctions and Galois Connections 



31 



where fS m is the “orbit” {fd : 0 £ S(X)} = {f{x a ( i), ...,z CT ( m )) : a £ 

Sm }• 

The order of the stabilizer 1(f) is m\/d, where d is the degree of m,f 
and ml is the order of S m ~ S(X). Moreover, 

m f (x) ml/d = rLespc)^ - fO). 

Lagrange did not carry through a general proof but considered a few 
particular cases, among them two-, three- and four-element subgroups 
of S(X) and the case 1(f) = S(X), which amounts to the Fundamen- 
tal Theorem on Symmetric Functions. Conversely, that theorem is the 
base for a rigorous proof of the general statement. How the general 
theorem extends to algebraic elements instead of rational functions, 
and how it provides a perfect adjunction between subgroups and min- 
imum polynomials will be discussed in Sections 2.3 and 2.4, dealing 
with the later progrees made by Galois and Dedekind. 

The essence of Article 104 in Lagrange’s memoir [86] is the following 
remarkable statement (in English translation): 

If t and y are any two [rational] functions ...[in the roots 
of a polynomial equation] and if these functions are such 
that every permutation of the roots which changes y also 
changes t, one can, generally speaking, express y rationally 
in terms oft and ...[the coefficients of the equation]. 

In modern form, that statement reads as follows: 

Theorem on Stabilizer Groups of Rational Functions 

For any two functions t,y £ K(X), the group I(t) is contained 
in the group I(y) if and only if there is some polynomial q over K(S) 
such that y = q(t). Hence 

I(t) C I(y) ^ y<t ^ K(S)(y) C K(S)(t). 

Thus, there is a perfect adjunction between the similarity classes of 
K(X) and the simple field extensions of K(S) contained in K(X). 

The “if” part is obvious. For the “only if” part, Lagrange first assumes 
(in Article 100 of the memoir) that t and y are similar functions in 
xi,...,x m and concludes that if ti,...,t n are the various “forms” ob- 
tained by permutation of the variables in t, the same permutations 
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yield the different forms yi,...,y n of y. Then he remarks that the 
sums 

Mj = YJi= 1 VVi U = n ~ 1 ) 

are symmetric functions, hence rationally expressible by the elemen- 
tary symmetric functions (the coefficients of the general polynomial 
equation of degree m). It is now clear that, solving that system of 
linear equations for yi (given the values tj), one obtains each yi as 
a rational function of all the values ti,...,t n . But Lagrange’s crucial 
observation is that yi is already expressible rationally in terms of the 
coefficients Mj and the corresponding single For that, he carries 
through rather long and complicated calculations, which, admittedly, 
lead to more general statements, also involving equations with mul- 
tiple roots. A rather easy proof (as proposed by Edwards [39]) uses 
Cramer’s rule. But still easier and more elegant is Dedekind’s idea to 
use another fundamental tool invented by Lagrange, namely his inter- 
polation polynomials. How that approach works will be explained in 
Section 2.4. 

Combining the above result with the Theorem on Partially Symmetric 
Polynomials, one already obtains the major part of the Fundamental 
Theorem of Galois Theory for the “generic case” of variable roots: 
namely, a dual order embedding of the subgroup lattice of the sym- 
metric group S(X ) in the closure system of intermediate fields between 
K(S ) and K{X). In order to see that this Galois connection is per- 
fect on both sides (hence an isomorphism), one only has to assure 
that each intermediate field occurs in that way as a simple extension 
- and that remaining statement is just a special case of the Theorem 
on Primitive Elements, due to Abel [1] and Galois [56] (see the next 
section). 

Fundamental Theorem of Galois Theory for Rational Func- 
tions 

The subgroup lattice of the symmetric group S(X) is dually isomorphic 
to the lattice of fields between K(S ) = Ks(X) and K(X). 

The main problem in extending the above result to the general situ- 
ation of “numerical” roots ri, ...,r m of a polynomial equation instead 
of variables is the definition of the group I (/) in that case (where / is 
now an element of K{r \, ..., r m ) rather than a rational function), and 
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the solution of that problem was one of Galois’ major achievements. 

Adjunctions or Galois connections sometimes arise in a somewhat un- 
expected way, for example in group theory. In what follows, we shall 
have occasion to see how Lagrange’s Subgroup Theorem together with 
a suitable adjunction between divisors and subgroups (also called “di- 
visors” by Galois and Dedekind) leads to a basic theorem on cyclic 
groups. As demonstrated by Gauss, Abel and Galois, such groups are 
of central importance for the solvability of certain equations and for 
the theory of finite fields. In particular, the reasoning below yields the 
existence of primitive roots modulo prime numbers, a central tool in 
number theory developed by Gauss. 

Given a finite group G of order n whose neutral element is denoted 
by 1, let L(G) designate the closure system (hence complete lattice) 
of all subgroups of G , and D{n ) the lattice of all divisors of n. By 
Lagrange’s Subgroup Theorem, one has a map 

Ag : L{G) — >■ D(n), H ^ #H, 

assigning to each subgroup its order (cardinality). If G is commuta- 
tive, one also has a map in the opposite direction, 

q g : D(n) — )• L(G), d i — > {g € G : g d = 1}, 

sending each divisor d of n to a subgroup (!) of G. 

For h £ G, let G{h) denote the cyclic subgroup generated by h. If 
G is finite then the order o(h) of G(h) divides the order of G , and in 
particular, h 0 ^ = 1. 

Characterization Theorem for Cyclic Groups 

Given a finite group G of order n, the following statements are equiv- 
alent: 

(a) G is cyclic. 

(b) G is isomorphic to the additive group of integers modulo n. 

(c) A G and g G are mutually inverse isomorphisms. 

(d) A g and g G form an adjoint pair. 

(e) For all divisors d of n, the equation g d = 1 has at most d solutions. 

Proof. The equivalence (a) (b) and the implications (a) =>• (c) =>• (d) 

=^(e) are straightforward: if G is cyclic of order n, say G = G(h), 
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then for each d G D(n), the unique subgroup of G having order d is 
G{h n l d ). 

(e)=>-(a). For d = o(h), we have G{h) Cg G (d), and # g G (d)<d forces 
g G (d)=G(h). Now, put c G {d) = #{heG : o(h) = d}. From (a) 44- (b) 
for d instead of n we infer that g G (d) =G(h) ~ Z^ — Z n (n/d). Thus 

c G (d) = c ea ( d )(d) = ci n {n/d){d) < c Zn (d), 

and so there is an injection l from G into Z„ with o(h) = o(i(h)). 
Since both groups have the same order, t must be onto, and as Z„ 
does have an element of order n, so does G. □ 

The implication (e)=>-(a) applies to any finite subgroup G of the mul- 
tiplicative group of a field K, because the polynomial x n — 1 has at 
most n roots in K. This gives a short proof of the famous 

Theorem on Multiplicative Subgroups of Fields 

All finite subgroups of the multiplicative group of a field K are cyclic. 
In particular, if K is finite then K \ {0} is a cyclic group under mul- 
tiplication. 



2.3 Evariste Galois' Memoir on the Solvability of 
Equations 

Cet Ouvrage n’ayani pas ete compris , les propositions qu’il 
renferme ay ant revoquees en doute , j’ai du me contenter de 
donner, sous forme synthetique , les principes generaux et une 
seule application de ma theorie . Je supplie mes juges de lire 
du moins avec attention ce peu de pages . 

Galois’ preface to the second version of his memoir, 
crossed out by himself but added later by his friend 
Auguste Chevalier in the posthumous publication. 

Perhaps the most famous and controversial paper in the history of alge- 
bra is Galois’ Memoire sur les conditions de resolubilites des equations 
par radicaux , dated January 1831 but not published until 1846 by Liou- 
ville in the Journal de Mathematiques (see [56], or [57] for a later com- 
mented edition, or Edwards [39] for an English translation). Hundreds 
of elaborations, supplements, comments, refinements, and historical 
remarks have been written by later authors about that memoir (see, 
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for example, [139] and the bibliography therein). Almost everybody 
dealing with Galois connections cites the Fundamental Theorem of 
Galois Theory as the “classical prototype” of such connections. How- 
ever, that theorem, stated in its modern form as a dual isomorphism 
between the subfield lattice of a Galois extension and the subgroup 
lattice of the corresponding Galois group, was formulated explicitly 
quite some time after Galois. Surprisingly, when looking at the vaste 
literature on Galois theory, one misses an investigation of the role of 
Galois connections in Galois’ original papers. At least implicitly, a 
few adjunctions and Galois connections already emerge in Galois’ pi- 
oneering memoir - though, admittedly, one will not find any explicit 
order-theoretical terms in Galois’ formulations. However, the natural 
order of fields and groups via inclusion is inherent in the whole theory, 
and so are two other natural orders, namely the divisibility relation be- 
tween numbers and that between polynomials. As mentioned earlier, 
in other writings Galois even used the word “divisor” for subgroups, a 
convention adopted by Dedekind in his Lectures on Algebra [26] (see 
the next section). 

Galois’ memoir starts with a series of “principles” and “definitions” . 
Instead of giving explicit rigorous definitions of fields and groups, Ga- 
lois describes how “rational” or “rationally known quantities” are ad- 
joined stepwise on the way towards the solution of a polynomial equa- 
tion, and he indicates how “groups of permutations” have to be formed 
and investigated for that purpose. 

Given a finite set Z = {a, b, c, ...} of m elements (“letters”), Galois 
means by a “ permutation ” (of the letters) any m-tuple whose entries 
are the distinct elements of Z - or, in modern terminology, any bisec- 
tion between m = {1, ..., m} and Z; while a substitution in the sense of 
Galois (and Dedekind) is a bijection 9 : Z — >■ Z (hence a permutation 
of Z in the usual sense also adopted by Lagrange). Of course, it is easy 
to establish a one-to-one correspondence between these two concepts: 
denoting by P(Z) the set of all “permutations” ir : m — >• Z and fixing 
one such 7r, one obtains bijections 

G : S(Z) -> P{Z), 9^tv9 = 9oi r, 

r-K : S(m) — >• P(Z), a i-» an = %oa, 
which become group isomorphisms if the product of two permutations 
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ip,£ is defined as '0 7r_1 £ (whence n is the neutral element). Thus, 
S m = S(m),S(Z ) and P(Z ) are pairwise isomorphic groups in our 
modern sense. 

Galois gives a double description of his “groups”: first, as collections 
of “permutations” with the following two properties: 

(i) each of the permutations arises from one of them by a [bijective] 
substitution of the letters, 

(ii) the set of substitutions involved is always the same, no matter 
which permutation one starts from. 

Alternatively, Galois takes those substitutions as the constituents of 
his groups and concludes that such collections of substitutions must 
be closed under composition (which by the finiteness of the underlying 
set suffices to assure the existence of inverse elements) . 

The passage between the two group concepts may be formulated in 
the framework of a perfect Galois connection. In order to avoid ambi- 
guities, let us speak in the present context of a grouping block when we 
mean a set of permutations satisfying the above conditions (i) and (ii), 
while subgroups of the symmetric group S(Z) will be referred to as 
substitution groups. Hence, for any grouping block G and 7r G G, the 
set Sq = 7 r -1 Cr is a substitution group not depending on the chosen 
7r, and is referred to as the substitution group ofG. Furthermore, let 
us call two grouping blocks G, H equivalent if there is a a G S m with 
uG = H. By a grouping, we mean a partition of P(Z) into equivalent 
grouping blocks. This is the central tool in Galois’ theory. Now, a 
sequence of straightforward verifications yields: 

The Galois Correspondence Between Subgroups and Group- 
ings 

(1) For a fixed permutation ir, the map T i-> ttT is a bisection between 
the subgroup lattice of S(Z) and the set of grouping blocks containing 

7T. 

(2) The map T i— »■ {7 rT : % G P{Z)} is a dual isomorphism between 
the subgroup lattice of S(Z) and the collection of all groupings, ordered 
by refinement. 

(3) In the opposite direction, the map G 1 — >■ Sq is a surjection from 
the set of grouping blocks onto the subgroup lattice of S(Z), and two 
grouping blocks are equivalent iff they have the same substitution group. 
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A slightly different point of view would be to consider, instead of 
groupings, the associated equivalence relations £ on P{Z): these 
are characterized by the property that 7r £ if entails air E aif for all 
a e S m . Such equivalence relations might be called left congruences on 
P(Z). By the above remarks, they are in one-to-one correspondence 
with the subgroups of S(Z). If, moreover, irOE'ipd for all substitu- 
tions 9 G S(Z) whenever ir E ip, then we could speak of a congruence. 
It is then easy to check that the congruences precisely correspond to 
the normal subgroups of S(Z). On the other hand, the congruences 
are associated with those groupings which have the property that one 
passes from one block of the partition to another by one single sub- 
stitution. And, as Galois discovered, that is just the condition that 
reflects the splitting of a polynomial into factors under the adjunction 
of all roots of an auxiliary equation. Or, in modern terms: under the 
Galois connection between fields and groups, normal field extensions 
correspond to normal subgroups. 

By the above reasoning, there exists a perfect adjunction between 
left congruences on P(Z) and subgroups of S(Z). And in fact, that 
adjunction is induced by an axiality (see Section 1.4) between the 
power sets of P(Z ) xP(Z) and of S(Z), which in turn comes from a 
very natural function between these sets, namely 

F:P(Z)xP(Z)->S(Z), (ir, ip) ^ ir^if. 

The axiality (~*F, F^) sends any left congruence £ C P(Z) x P(Z) 
to the subgroup {ir~ l if : ir E if}, and in the reverse direction, any 
subgroup T of S(Z) to the left congruence { (7r, if) : ir~ l if G T) -- which 
happens to be a congruence just in case T was a normal subgroup. 

After having stated his “principles” , Galois formulates his famous four 
basic lemmas on which he founds his theory of the solvability of poly- 
nomial equations. We summarize them in slightly modernized termi- 
nology. 

Galois’ Four Basic Lemmas 

I. Any algebraic element a over a field K is the root of a “minimum 
polynomial” m a> K that divides any other polynomial vanishing at a. 

II. Given a finite subset Z and an infinite subset A of a field, there is 
a linear form f with coefficients in A such that all values v obtained 
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by substituting elements of Z in that form are distinct. 

III. Any such value is a primitive element of K(Z): K(Z) — K{v). 

IV. Ifv andv' are conjugate (i.e. have the same minimum polynomial) 
then g{v) = 0 O g( v ') = 0 for any rational function g. 

Notice that Lemma II may be shown by an elementary induction on 
the size of Z, without using any higher algebraic tools. After Galois, 
any rational function / with the property stated in Lemma II (ex- 
cept linearity) or its value v obtained by substituting the roots of the 
proposed equation is referred to as a Galois resolvent for the given 
equation. Alternatively, some authors mean by a Galois resolvent the 
minimum polynomial for v. 

Then Galois proceeds to his famous Proposition I, building the basis 
of Galois theory, which he formulates as follows: 

Theorem 

Let an equation be given whose roots are a,b,c,... There will always 
be a group of permutations of the letters a,b,c,... which will have the 
following property: 

1. that each function invariant under the substitutions of this group 
will be known rationally; 

2. conversely, that every function of the roots which can be determined 
rationally will be invariant under these substitutions. 

In a footnote, Galois explains what he means by “invariant under 
these substitutions 

We call here invariant not only a function whose form is 
invariant under mutual substitutions of the roots, but also 
those whose numerical value is unchanged by these substi- 
tutions. 

Galois’ “demonstration” consists of a few typical laconic statements. 
Referring to Lemmas III and IV, he considers all conjugates v = 

Vo , ..., v n -\ of some resolvent v. Then, choosing polynomials <p = cp 0 , ..., <p m -\ 
such that (p 0 (v), ..., (v) are the roots of the original equation, he 

forms the collection of all tuples ( ipi(vj ) : i < m) (j < n ) and con- 
cludes: 

I say that this group has the stated properties. 
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That the so-defined tuples actually are permutations of the roots is an 
easy consequence of Lemma IV, but the second “group” property is not 
so obvious. Most of the proofs given by later authors use the notion of 
field automorphisms - undoubtedly an elegant tool of modern Galois 
theory, but one that is not contained in Galois’ approach. Again, a 
rigorous proof was supplemented by Dedekind (see Section 2.4). 

Galois emphasizes the fact that adjoining new quantities may “reduce” 
the equation inasmuch as the polynomial in question may decompose 
into smaller factors after the adjunction. Of course, in that context, 
the word “adjunction” is to be understood in the field-theoretical and 
not in the order-theoretical sense. Nevertheless, there is also an order- 
theoretical adjunction behind these remarks of Galois, which may be 
described as follows (using the later notion of field extensions). 

The intermediate fields L between a field K and an extension E form 
a closure system T{E : K), hence a complete lattice with respect to 
inclusion. If E is a simple algebraic extension K(a), we shall write 
E(a,K) instead of T{E : K). For each divisor h of the minimum 
polynomial / = m a: K in E[x], the field A a ,K(h) := K(h ) obtained 
by adjoining the coefficients of h to K is a member of E(a, K ) (note 
that formally h is just the sequence of its coefficients). Let A 4(a,K) 
designate the set of all monic divisors h of / in E[x] that are irreducible 
over K(h ) and vanish at a - in other words, the minimum polynomials 
of a over extensions of K contained in E. Then we have a map A Q) ^ 
from M(a,K ) to E(a,K). In the opposite direction, associate with 
any L € E(a,K) the minimum polynomial f? a) #(L) = m a ,L of a over 

L, to obtain a map g a K from J-{a, K ) to A4(a, K). We write g \ h and 
call g a strong divisor of h if g divides h in K(g)[x\. Strong divisibility 
is easily seen to be a partial order on monic polynomials. 

The Perfect Galois Connection for Minimum Polynomials 

For any algebraic element a over K, the pair (X a ,K, Q a ,K) a P er f ec t 
Galois connection between M.{a,K), the set of minimum polynomials 
of a dividing m at K, ordered by strong divisibility, and E(a,K), the 
interval of all fields between K and K{a), ordered by inclusion. Hence, 

M. (a , K) and E(a, K ) are dually isomorphic finite lattices. The degree 
functions 

g : A4(a, K) u> and fi : T{a , K) — >■ lo, 14 [K(a) :L\ 
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are scales with jl = go g and g = jloX. 

Proof. That A and g form a Galois connection between the dually 
ordered sets A i(a,K) and E(a,K) follows from Lemma I: for any 
h £ M(a, K) and L £ E(a,K), we have the equivalences 

A (h) = K(h) C L h £ L[x\ and h(a) — 0 4=> g(L) = m a , L t| h. 

Indeed, g = m a> £, divides h in L[x] iff g divides h in K(g)[x], because 
both are monic polynomials (cf. Gauss’ Lemma). 

In order to see that A and g are mutually inverse dual isomorphisms, 
it suffices (by the Theorem on Perfect Galois Connections, Section 
1.3) to prove the statements about g and jl. These two functions are 
related by 

ML))=M™«,l) = [Ua)-.L\ = [K(a) :L] = ji(L). 

If L C L' and jl(L) = jl(L') then L = L' , because the vector spaces L 
and L' have the same dimension [L:K] = j i(K)/g(L ) = jl(K) / jl(L') = 
[L 1 :K], so that not only g but also jl = go g is a scale. 

As any h £ Af (a, K) is monic, irreducible over K (h) and has the root 
a, it must coincide with the minimum polynomial g(X(h)) = m a ^K{h)- 
Thus, £>oA is the identity map. It follows that g = go go\ = jloX. 
Since J-(a, K ) is a lattice, so is A t(a, K ), and since the latter is finite, 
so is J-(a, K). □ 

From the fact that jl inverts the divisibility relation (by the degree 
formula for fields) and g is antitone, it follows that g — jloX is an 
isotone scale. Thus, the degree of a strong divisor of a polynomial h 
in Af (a,K) is not only smaller than but a divisor of the degree of h 
(cf. Lagrange’s Theorem on Minimum Polynomials!) 

An immediate consequence of the above theorem is the “only if” part 
in the 

Theorem on Primitive Elements 

A field extension E : K is simple and algebraic (i.e. E = K(a) for 
an algebraic element a over K) if and only if the interval T{E : K) 
is finite. For characteristic 0, these two conditions are equivalent to 
finiteness of [E:K\. 

For the “if” part (cf. [90, 98]), one first observes that the degree 
[E : K] must be finite (otherwise, one would obtain an infinite chain 
of intermediate fields). Then, one distinguishes the following cases: 
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if K is finite then so is E, being a finite-dimensional vector space 
over K. Hence, any generating element of the cyclic group E* (see 
the Theorem on Multiplicative Subgroups of a Field) may serve as a 
“primitive element” generating E. If K is infinite then one proceeds 
by induction and assumes in the induction step that E = K(b, c ) 
for algebraic elements b,c over K. If T{E : K) is finite, there exist 
different elements k,k' G K such that K(b + kc) = K(b + k'c). Hence, 
for a = b + kc, it follows that the elements c = (a — b — k'c)(k — k')~ 1 
and b = a — kc belong to K(a), i.e. E = K(a). Third, if m Cj K has no 
multiple roots (which is for sure if K has characteristic 0), a different 
argument works: consider k G K and assume m c>K (c') = 0 = (&') 

for some d ^ c and b' = b + k(c — c') . Since there are only finitely 
many such k = (b 1 — b)(c — c') _1 , we may choose a k G K so that for 
a = b + kc, the polynomials m Ct K and m^Kib + k(c — x)) G K(a)[x] 
have the greatest common divisor x — c in K(a)[x\. Again, we get 
c G K(a) and b = a — kc G K (a), i.e. E = K(a). 

2.4 Richard Dedekind’s Lectures on Algebra 

In einer Vorlesung als Privatdocent in Gottingen behandel- 
ten Sie die damals noch ganz unzugangliche Theorie von Ga- 
lois. Aber bei allem verstdndnisvollen Eingehen auf fremde Art 
wufiten Sie dock stets Ihr Selbst zu behaupten, dem Uberkomme- 
nen Ihr personliches Geprage aufzudrucken. 

Laudatio of the Royal Academy at 
Berlin on Richard Dedekind (1902) 

The Lectures on Algebra alluded to in the quoted Laudatio on the 
occasion of the 50th anniversary of Dedekind’s Ph. D. thesis proba- 
bly contain the first elaborated version of Galois’ theory, including a 
systematic elementary theory of (abstract!) finite groups. Unfortu- 
nately, the unpublished manuscript of these lectures, written some- 
where between 1858 and 1868, had been lost for more than a hundred 
years until its rediscovery by Purkert [116] and the commented edition 
by W. Scharlau in [121] on the occasion of the 150th anniversary of 
Dedekind’s birthday (1981). 

Content and style of the rediscovered manuscript demonstrate the 
power of an ingenious mathematician whose striking ideas and de- 
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ductive accuracy put him far ahead of most, if not all of his contem- 
poraries. Indeed, if some antiquated idioms and long-winded verbal 
formulations in the text would be replaced by modern terms and for- 
malisms, an almost up-to-date exposition of group and field theory 
and other parts of algebra would arise. Unlike Lagrange, Dedekind 
always endeavored to giving concise and elegant proofs, and on the 
other hand, unlike Galois, he did not omit important arguments or 
constructions. The reader of the present survey will already have 
remarked that when one tries to make certain statements of those 
forerunners precise, one has to refer to Dedekind’s work again and 
again. 

In the preparatory Chapter I on groups, Dedekind deals not only 
with permutations, but also with (finite) abstract groups, normal sub- 
groups, and even with factor groups. Following Galois, he speaks of 
substitutions rather than permutations and defines their product ac- 
cording to the (rather modern) convention that the substitution 99' 
first applies 9 and then 9'. Then he proves the associative law and the 
cancellation laws and remarks that these two “Fundamental Laws” 
are the base for all further conclusions, so that they may be used for 
an abstract definition of finite groups, occurring also in many other 
mathematical contexts. (Dedekind is well aware of the fact that the 
finiteness hypothesis is essential for the desired conclusions.) 

As a first highlight, he derives Lagrange’s Subgroup Theorem and 
some of its prominent consequences from the partition 

G = H9 + H9' + H9" + ... 

of a group G into cosets of a subgroup H, mentioned already in Ga- 
lois’ letter to his friend Auguste Chevalier [57]. Since all cosets have 
the same cardinality as H, their number (the index of the subgroup) 
multiplied by the order of H must give the order of the entire group 
G. But Dedekind goes further, analyzes decompositions into two-sided 
cosets 9 H 9' and shows, for example, that the intersection of all conju- 
gates 9~ l H 9 is a normal subgroup ( “eigentlicher Divisor”). Perhaps 
his most modern achievement in the area of group theory is the ob- 
servation that the cosets of a normal subgroup again form a group 
under the “complex product” - a conclusion that requires a degree of 
abstraction entirely alien to other mathematicians of his time. 

In Chapter II, Dedekind develops in a concise and rigorous way the 
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main pieces of Lagrange’s theory of partially symmetric functions and 
their behavior under substitutions. In particular, he proves the funda- 
mental fact that any representative system for the cosets of the stabi- 
lizer I (t) of a rational function t produces all different “forms” of t ob- 
tainable by substitutions of the variables. After a careful proof of the 
existence and uniqueness of the representation of symmetric polyno- 
mials by means of elementary symmetric functions, he derives the Fun- 
damental Theorem on Symmetric Functions, observing that if / and 
h are polynomials with no common divisors so that f /his symmetric, 
then both / and h must be symmetric. But Dedekind does not leave it 
at that but points out that this conclusion requires (among other tools) 
Gauss’ Lemma, and that one may circumvent that tool by the follow- 
ing argument: in the rational function f/h — [f fi---fk)/(hfi---fk), 
where the polynomials fj are the different results obtained from / by 
permuting the variables in /, the numerator is symmetric by defini- 
tion, and so is the denominator hfi...fk — f fi-.-fk • h/ f. 

As mentioned earlier, the symmetric group S(X) of all bijections of 
X = {xi, acts on the polynomial ring I? [A] = R[x i, ...,x m ] by 

9(f)(x 1, -,x m ) = fO(xi, -,x m ) = f(0(x i), ...,9(x m )) for 9 G S(X). 

In his proof of Lagrange’s Theorem on the Stabilizers of Rational 
Functions, Dedekind uses the explicit representation of 

Lagrange’s Interpolation Polynomials 

For any m distinct elements ti,...,t m and arbitrary (not necessarily 
distinct) elements yi, ...,y m °f a field K, there is a unique polynomial 
q G K[x\ with q(tj) = yj for all i = 1, namely 

«M = £”i (*%%,) > where 

pw = nr=i(* - u) P '(x) = E™ , Sj. 

This applies to the following situation. Choose a set of representatives 
Oi =id , ..., 9 m for the “cosets” G9 of G = I(t ) in S(X), so that S(X) is 
the disjoint union of the cosets G9j ( j = 1, ..., m). Then the elements 



tj = 9j ( t ) = t9j G K(X) 
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are pairwise distinct (because tOi = tOj implies G6i = G9j , hence 

i = j)- 

On account of the inclusion I(t) C I(y), the elements yj = 9j(y ) = 
y9j G K(X) satisfy the implication 9{U) = tj =4 9(yi) = yj for each 
9 G S{X). Indeed, 



t9{9 = t9j 
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l el{t) =4 9i99j l £l{y) 44 = y9j. 

Now, the interpolation polynomial q G K{X)[x\ with q(tj) = yj has 
the required property q(t) = q(t i) = y\ = y. From the construction 
it is rather plausible that q is symmetric, but an exact proof requires 
some care. First, one observes that p(x) = XYi = i(x — tf) is symmetric, 
because application of any 9 G S(X) merely permutes the elements L 
(cf. Lagrange’s Theorem on Minimum Polynomials). Hence, 9{p) —p 
and 9(p') = p' . As 9{ti) = tj implies 9{yi) = yj , one obtains 9(q)(x) = 
9{w) e (p)( x ) — n („\ n 

2^=1 {x-9{q))p'{6{q)) - 9\ x )- 



In Chapter III, Dedekind supplements most of the missing proofs in 
Galois’ memoir, but he also makes clear that a great part of Galois 
theory, even in the general case of “numerical roots” , may be developed 
along the lines of Lagrange’s theory. For a better understanding, let us 
restate Galois’ Fundamental Theorem more formally, using Dedekind’s 
concepts and terminology. 

Fundamental Theorem on Fixpoint Fields 

Let f(x) = 0 be a polynomial equation over K with no multiple roots. 
Let Z = {ri,...,r m } be the set of these roots, s a Galois resolvent for 
the equation ( whence K(s ) = K(Z)), and so = s, ...,s n _i its conju- 
gates. Then: 

(1) There are rational functions fj G K(x i, ...,x m ) with ri — fi(s). 

(2) The maps 9j : r* >->■ fi(sj) ( j < n ) form a subgroup G(f) of S(Z). 

(3) An element of K(Z) is invariant under each 9j iff it belongs to K. 

(4) The maps 9j extend to K - automorphisms of K(Z). 

(5) The K - automorphism group of K(Z) is isomorphic to G(f). 

(6) The group G(f) does not depend on the choice of the resolvent s. 

(7) The groups G(f) and G{m Sy K ) are isomorphic. 

All of these statements are proven in Dedekind’s Lectures (however, 
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without explicit mention of the notions of fields and automorphisms, 
and only for the case of characteristic zero). For a careful elaboration 
of some details, see Edwards [39] and Tignol [138]. The group G(f) = 
G(f,K) is referred to as the Galois group of the equation f(x) = 0 
over K. 

Dedekind’s view of the situation is quite modern, emphasizing the 
advantage of group actions on a set: his crucial observation in that 
context is that the Galois group of an equation f(x) = 0 operates on 
the corresponding “splitting field” K(Z). One has to assure that the 
definition of the action of the Galois group on K(Z) = K(ri, ...,r m ), 
given by 

uB = g(n0 , ..., r m 9) if u = g(r u ..., r m ) 

does not depend on the choice of the rational function g (which by no 
means is uniquely determined by u ). To that aim, represent each r* in 
the form ffis) (with fi G K(x)) as a rational expression of the resol- 
vent s. Now, if g(ri, = h(r\, then the rational function 

(g — h)(fi(x ), ..., f r (x)) vanishes at s and consequently also at each 
conjugate s9, where 9 runs through the Galois group. In other words, 
g(ri9 , ..., r m 9) = h(n9 , ..., r m 9). 

Using consequently the machinery of the Galois group action, Dedekind 
extends Lagrange’s Theorem on Stabilizers to the present setting of 
“numerical roots”. Denoting again by I(t) the stabilizer of t in the 
Galois group G(/), that is, the subgroup of all 9 G G(f) with t9 = t, 
one obtains by the same proof as in the case of rational functions: 

Dedekind’s Theorem on Stabilizers 

If s is a Galois resolvent for an equation f(x) = 0, then for any two 
elements t,y of the field extension K(s ) one has the equivalence 

mcl(y) « K(s)(y)CK( S )(t). 

We are now in a position to prove another variant of the Fundamental 
Theorem of Galois Theory that might be rather close to Galois’ own 
ideas, namely an 

Isomorphism Between Subgroups and Minimum Polynomials 

Let s be a Galois resolvent for the polynomial equation f(x) =0 over 
K. Then the subgroup lattice Q(f, K) of the Galois group G = G(f, K ) 
is isomorphic to the lattice M(s,K) of all minimum polynomials for 
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s. The following maps are mutually inverse isomorphisms, hence form 
a perfect adjunction: 

A : Q(f , K)->M(s,K), p H (x) = l[ eeH {x - s6 ) , 
g:M(s,K)^ Q(f, K), p ^ H p = {6 £ G : p(s9) = 0}. 

Proof First, we must verify that both mappings are well-defined. In 
order to see that for each polynomial p £ M(s,K ) the set H p is a 
subgroup of G, consider substitutions 9,9' £ G (and their extensions 
to AT-automorphisms) with p(s9) = p(s9') = 0. If we can prove 
the equation p(s99') = 0 then H v is closed under composition and 
therefore (by finiteness) a subgroup of G. There are rational functions 
t and t' in K(x) with s9 = t(s), s9' = t'(s) and s99' = t(t'(s)). Hence 
we have pot(s) = p(s9) = 0, and as pot belongs to K(p)(x), it follows 
that p = m Sj K( P ) divides the numerator of pot; now, p(s9') = 0 implies 
p(s99') = pot(s9') = 0, as desired. 

For any subgroup H of G, it is clear that pn is a monic divisor of 
Pg = m St K with pn(s) = 0, since H contains the identical substitution. 
For each 9' £ H, we compute 

Ph0'(x) = ILe* (* “ s0e> ) = FUeffO* “ s0 ) = Ph( x )- 
Now, consider any h £ K{pu)[x] with h(s) = 0. As 9' fixes all elements 
of K(ph), it follows that h(s9') = h{s)9' = 0 for each 9' £ H, whence 
Ph divides h. This shows that pn is the minimum polynomial of s 
over K(p H ), be. Ph € M(s,K). 

That the two mappings are mutually inverse is easily checked: on the 
one hand, we have 

Ph p {x ) = El 0 eG:p(s 0 )=o( x ~s9)= p{x) 
for p £ A4(s, K); and on the other hand, for H £ £?(/, K) we get 

H n = {ff e G : UkhW - «0) = 0} = H. 

Finally, the equivalence 
H C H p & V9 eH(p(s9) = 0) ^ p H \\P 
shows that we have a perfect adjunction. □ 

One of the various interesting consequences of the last theorem is 
Dedekind’s observation that the degree of the minimum polynomial is 
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always equal to the order of the corresponding group. 

Composing the previously derived adjunctions and Galois connections 
between groups, minimum polynomials and fields, one obtains the 
Fundamental Theorem of Galois Theory in the form of a perfect Ga- 
lois connection between the subfields of a field extension and the sub- 
groups of the corresponding automorphism group, as established by 
Emil Artin(see [5] and Section 3.1). Let us connect the various Fun- 
damental Theorems of Galois Theory in a diagram. 



subgroups of the r t ^ GALOIS ^ ^ minimum 

Galois group G(f,K) * *Ai(s,K) polynomials 



DEDEKIND 



LAGRANGE 



subgroups of the G{s, K) ' _ ~ T ( -S . K) field extensions 

automorphism group ARTIN contained in K (s) 



2.5 Dedekind’s Order- and Lattice-Theoretical 
Concepts 



Es steht alles schon bei Dedekind. 

Emmy Noether 



It is evident from our introductory remarks about the idea of adjunc- 
tions and Galois connections that the fundamental ingredients in that 
theory are 

(1) order relations, 

(2) isotone (order-preserving) maps, 

(3) closure operations. 

Therefore, it is certainly of interest to search for the first appearance 
of these notions in the literature. 
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As Scharlau communicates in the memorial issue [121], he discov- 
ered in the posthumous works of Dedekind an unpublished prelim- 
inary draft (written probably quite some time before 1860) of the 
Lectures on Algebra, which contains the following remarkable lattice- 
theoretically inspired sentence (translated): 

General Theorem Stippose S\,S2 etc. are fields [“Ge- 
biete”] containing the coefficients of the equation f(x) and 
let G i, G 2 etc. denote the corresponding groups of the equa- 
tion. Then their greatest common divisor [GiDG 2 ] is the 
group with respect to the system S1S2 [the least common 
multiple of S\ and 82]. 

The first structural investigation involving (at least implicitly) the 
above three order-theoretical notions is probably Dedekind’s famous 
treatise on the axiomatics of natural numbers [29] “Was sind und 
was sollen die Zahlen?” (1887). Here Dedekind introduces “systems” 
(something like naive sets) and mappings, and derives in a rigor- 
ous way many elementary properties from a few definitions. Thus, 
Dedekind’s method may be regarded as the first successful attempt to 
develop a partly axiomatic, partly “naive” set theory for the founda- 
tions of mathematics. 

Though Dedekind defines set inclusion by membership of elements, 
he does not introduce an extra symbol for the latter, but uses the 
same symbol -< both for membership and inclusion. Aside from that 
little inconsistency, his development is so careful and precise that a 
few minor changes of derived notions or facts into axioms would pro- 
vide an axiomatic theory of ordered sets, regarding -< merely as an 
order relation rather than as the concrete containment relation. In- 
deed, Dedekind’s initial conclusions (which had to be more verbal than 
formal, because his “systems” are not defined axiomatically) are: 

A -< A (reflexivity) , 

A -< B -< A =>• A = B ( antisymmetry ), 

A -< B -< C =>- A ■< C ( transitivity ). 

Next, he derives the defining rules for the union, alias supremum 
( “composed system”) and intersection, alias infimum ( “community”), 
which in modern notation read as follows: 

A -< B for all A in A 4 =>- B, 
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B -< A for all A in A 44- B -< A. 

Moreover, he observes that if for each A G A there is a B £ B with 
A C B then \JA C and dually for intersections. Of course, in 
the abstract order-theoretical setting, one has to interpret the union 
and intersection symbols as joins and meets. 

Then Dedekind explains what he means by a mapping, its image, the 
restriction to subsets, etc. For a given mapping (p he denotes, as usual 
still today, by p(a) the image of an element a, and by tp{A) the image of 
set A, consisting of all cp(a) with a £ A. In order to avoid ambiguities, 
we employ a slightly different symbol and write p(A) for the latter. At 
a few points, the abstraction of Dedekind’s theory from set inclusion 
to arbitrary order relations requires some care, because not all results 
may be transferred without additional hypotheses. The most impor- 
tant of these hypotheses is the preservation of unions, which is clear 
for the “lifted” maps between power sets, carrying subsets of the do- 
main to their images, while for maps ip between arbitrary ordered sets, 
one would merely require that they be isotone (i.e. order preserving): 

A -< B =4> p(A) -< <p(B), 

but not always that <p should preserve arbitrary joins, which for maps 
between complete lattices is equivalent to saying that ip is a left ad- 
joint. 

After having established the associative law both for the composition 
of mappings and for the binary union operation (but not for binary in- 
tersection), Dedekind considers one-to-one mapppings, which he calls 
“ahnlich” (resembling) or “deutlich” (distinctive), and derives the ba- 
sic properties of such mappings and their inverses. 

The crucial notions for his set-theoretical introduction of the natural 
numbers are then the “self-mappings” (mappings from a set into itself) 
and what he calls “Ketten” (chains), namely, systems K with p>(K) -< 
K. In the general setting, this nomenclature is not very suggestive, but 
of course, what Dedekind had in mind are chains of natural numbers, 
closed under the successor function. Perhaps, he also thought of the 
descending chain 

... <p n (K ) -< ... -< ip{K) -< K. 

Let us distill the order-theoretical essence: given a partial order -< and 
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an isotone self-map p on a complete lattice C, call an element K of C 
(p-) closed if p(K) A K\ as Dedekind proves, the system C v of all such 
elements is closed under arbitrary meets, hence a closure system in the 
set-theoretical case (cf. Section 1.2). Dedekind’s dual conclusion that 
this system will also be closed under unions depends on the hypothesis 
that the power set map p comes from a map on the ground set and 
therefore preserves unions. Order-theoretically, it is easy to show that 
if a map p preserves arbitrary, finite or directed joins etc., then the set 
of all 99 -closed elements is closed under formation of the corresponding 
joins. 

Dedekind denotes the least member K of C v that satisfies A A K by Ao 
or Pq(A). Thus, he introduces here (without using that terminology) 
the closure Aq of A and the closure operator p$ associated with C ^ (or 
induced by p). Dedekind’s crucial discovery is that a closure operator 
Pq enjoys and is determined by the following properties: 

A pq (A) £ Cip and A A K po (A) A K for K £ C 

which say that the corestricted closure map p 0 : C — > is left adjoint 

to the inclusion map ip : C v ^ C. Dedekind also establishes two of the 
three characteristic properties of closure operators, viz. extensivity 
and isotonicity, but not explicitly the third, idempotency. Instead, 
he derives some related formulas and the shortest of all conditions 
characterizing closure operators: 

B A Po(A) Ao{B) A Pq{A). 

It is a historical curiosity that the latter characterization was re- 
invented several times by various authors until the seventies of the 
20th century. 

Fundamental for the classical Galois theory are finite permutations, in 
particular the fact that an injective or surjective self-map of a finite 
set is already bijective. Apparently, that coincidence was assumed 
tacitly by Galois and his contemporaries; as a consequence, through- 
out the whole 19th century, the bijectivity of “substitutions” was not 
postulated explicitly. 

Again, it was an ingenious mental act of Dedekind to realize that a 
proof of such seemingly “obvious” facts requires an axiomatic defi- 
nition of finiteness, not only an intuitive comprehension of the word 




Adjunctions and Galois Connections 



51 



“finite” . And even bolder was his decision to declare finiteness of a set 
S by the postulate that there be no injective self-map from S onto a 
proper subset: 

Ein System S heifit unendlich, wenn es einem seiner echten 
Teile ahnlich ist; im entgegengesetzten Falle heifit S ein 
endliches System. 

In other words, Dedekind characterizes finiteness of S by the property 
that all injective self-maps of S are surjective! But he also observes 
that, in order to derive theorems about finite sets via induction, one 
often needs another definition, namely the existence of a bijection onto 
an initial segment of the natural numbers. Dedekind proves the equiv- 
alence of the two definitions, making use of a weak choice principle, 
later called the Axiom of (Countable) Dependent Choices. Today, we 
know that in set theories without choice principles, the second def- 
inition of finiteness is strictly stronger than the first one, and also 
that the existence of an infinite set is not provable on the base of 
the other elementary set-theoretical axioms. Curiously, Dedekind’s 
“proof” of the existence of an infinite set strongly contrasts with the 
formal proofs for all other statements he formulates before and after. 
At this point, Dedekind uses philosophical rather than mathematical 
phrases and claims that (translated) 

the world S of thoughts, that is, all entities that may be 
the object of reasoning, is infinite; because if s is such an 
element of S then the thought that s can be the object of 
my reasoning, is again an element of S... 

and so on. There exist some hints in the literature that in later years, 
Dedekind himself was not satisfied with that sort of argumentation. 

An interesting question in that context is this: can finiteness of a set 
S also be defined equivalently by the postulate that every surjective 
self-map of S be injective? 

That the answer is in the affirmative is well-known, but probably not 
the following simple proof using a suitable adjunction. It requires, 
however, the hypothesis that the power set VS of a finite set is still 
finite (which is easily verified by induction if the second definition 
of finiteness is accepted). The argument is then as follows. A map 
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(p : S — > S is obviously injective or surjective, respectively, if and only 
if so is the power set map 

-^p:VS^VS, X^{yeS\3xeX {tp{x) = y)}. 

But the latter is left adjoint to the inverse image map 

(p<~ : VS^VS, Y ^ {x e S | 3y e Y (<p(x) = y)} . 

And we know that injectivity of one partner of the adjunction is equiv- 
alent to surjectivity of the other. This shows that the two statements 

• every injective self-map of a finite set is surjective 

• every surjective self-map of a finite set is injective 

are actually equivalent. 

Having discussed a few implicit occurrences of adjunctions and Galois 
connections in the early work of Dedekind, we should now mention 
briefly his pioneering achievements in lattice theory - which certainly 
is an indispensible tool for the general theory of Galois connections. 
The two papers to be cited here are those from 1897 and 1900, respec- 
tively, 

Uber Zerlegung von Zahlen durch ihre grofiten gemeinsamen Teiler 
( On decompositions of numbers by their greatest common divisors) [30] 

and 

IJber die von drei Moduln erzeugte Dualgruppe 
(On the lattice generated by three modules) [31]. 

Much of the material in these two articles has been rediscovered and 
presented by Birkhoff in his fundamental monograph Lattice Theory 
[3]. We find it opportune to confirm here Noether’s “headline” quo- 
tation by pointing out that, if not “everything”, at least a lot of nice 
results (and their proofs) of lattice theory can already be found in the 
two papers by Dedekind. 

The theme in the title of the first paper is a rather specific number- 
theoretical one; probably, it did not receive much resonance among 
algebraists. But that paper contains the two basic concepts of formal 
lattice theory, namely Dedekind’s “Dualgruppen” and what is nowa- 
days called a lattice- ordered group (see Section 4.2). Dedekind accu- 
rately mentions Schroder’s work (see the next section) and appreciates 
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his priority in some of the relevant discoveries; but he also makes clear 
that his own emphasis is on the applications to ideal and number the- 
ory rather than logic and set theory ( “Systemlehre ”) , and that his 
method provides considerable simplification of Schroder’s results and 
proofs. Indeed, Dedekind’s definitions and deductions are of striking 
elegance and precision, though he rarely used order-theoretical argu- 
ments and completely avoided representations by diagrams (perhaps 
a “forbidden tool” at that time). 

We adapt Dedekind’s terminology to the modern common use of lattice- 
theoretical symbols and write V instead of + for the join and A instead 
of — for the meet. Dedekind’s “Fundamental Laws” are (1) the two 
commutative laws, (2) the two associative laws, and (3) the absorption 
laws 

(3') a V (a A b) = a and (3") a A (a V b) = a. 

Thus, Dedekind’s “Dualgruppen” are just what later have been bap- 
tized “lattices” by Birkhoff (Ore called them “structures”). At the 
end of the 19th century, the later meaning of “groups” was not yet 
standard, and some mathematicians used that term for every (or at 
least every finite) semigroup. At that time, the name “Dualgruppen” 
was chosen quite reasonably, pointing to the two commutative semi- 
group structures and their duality (in the sense that every true lattice 
identity entails its dual, obtained by exchanging the two fundamental 
operations). Unlike many other authors, Dedekind does not include 
(as stated by Birkhoff) the idempotency laws 

(4') a V a = a and (4") a A a = a 

in the list of axioms, but he derives them as a first step from the 
absorption laws in the shortest possible way. Then he emphasizes 
that the distributive laws 

(5') (a A b) V (a A c) = a A (b V c) 

(5") (a V b) A (a V c) = a V (b A c) 

(characterizing his “Dualgruppen vom Idealtypus”) imply each other 
and the modular laws (characterizing his “Dualgruppen vom Modulty- 
pus”) 

(6') (a A b) V (a A c) = a A (b V (a A c)) 

(6") (a V b) A (a V c) = a V (b A (a V c)). 
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Moreover, he shows that either of these two laws is equivalent to the 
self-dual identity 

(M) (a V (b A c)) A (b V c) = (a A (b V c)) V (b Ac). 

In order to avoid misinterpretations, it should be mentioned that 
Dedekind has systems of fractional ideals (of a number field) in mind 
when he speaks of the “Idealtypus” , while the lattice of all ideals of 
a ring in general fails to satisfy the distributive laws, though being 
always modular. Next, Dedekind describes, by means of 5 x 5 Cayley 
tables, the smallest non-modular lattice (now known as the pentagon) 
and the smallest modular but non-distributive lattice (the diamond), 
which he concedes “took him some effort”. And Dedekind also was 
the first to show (in the second paper [31]) that modular lattices are 
characterized by the exclusion of pentagon sublattices; the analogous 
and deeper characterization of distributive lattices by exclusion of pen- 
tagon and diamond is due to Birkhoff [3]. But the equivalence of the 
distributive laws to the self-dual “median law” 

(a V b) A (b V c) A (b V c) = (a A b) V (b A c) V (b A c) 

and its proof are already contained in Dedekind’s second paper, too. 
The central subject of his investigation is the free modular lattice 
with three generators, which Dedekind describes by a Cayley table 
covering two pages. Since this 28-element lattice occurs in many later 
investigations of other authors (for example, in the paper [150] by 
Ward and Dilworth to be discussed in Section 4.3), we sketch here a 
diagram whose points carry the original notation used by Dedekind. 




; 4 i 
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That Dedekind rarely used order-theoretical arguments does not mean 
he would not have known the order characterization of lattices. In the 
first paper, he gave a purely set-theoretical description of lattices in 
terms of their principal (dual) ideals, hence implicitly in terms of an 
order relation he called “divisibility” , denoted by > and defined by 

a > b 4=>- a V b = b a Ab = a. 

(Dedekind wrote > instead of < because he had the interpretation 
of divisibility of ideals in mind, which is dual to set inclusion.) In 
the second paper, he explicitly stated the three properties character- 
izing partial orders (reflexivity, antisymmetry and transitivity), and 
observed that the operation V is the “greatest common divisor” with 
respect to divisibility (the join with respect to the dual relation), while 
A is the “least common multiple” (the meet for the dual relation). 

Since a map between complete lattices preserves joins iff it is left 
adjoint, it is evident that (finite or infinite) distributive laws are closely 
related to adjunctions for unary operations. But also modularity can 
be described in terms of certain adjunctions, as we shall see below from 
a fundamental characterization of modularity, again due to Dedekind. 
Two elements a, b of a lattice L form a modular pair, written aMb, if 

(x A b) V (a A b) = (( x Ab) V a) Ab for all x G L, 

or equivalently, x < b implies x V (a A b) = (x V a) A b ; if the dual 

condition holds, one writes aM*b (cf. [3]). The interval [a,b] consists 
of all elements x with a < x < b. Now a straightforward verification 
yields the so-called 

Transposition Principle for Intervals 

For any two elements a,b of a lattice L, the maps 

f) a : [a A b, b] -A [a, a V b] , x \ -A x V a and 

(fib : [a, a V 6] — > [a A b, b ] , y \-A y Ab 

form an adjoint pair. Furthermore, 

aMb xf) a is injective <pb is surjective, 

aM*b (ft, is injective if a is surjective. 

Hence, L is modular iff the adjunctions (f) a , <pb) are always (left, right) 
perfect, in other words, ip a and ip>b are always mutually inverse isomor- 
phisms. 
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For finite modular lattices, Dedekind concludes that all maximal chains 
have the same length, and that the degree ( “Stufe”) or height func- 
tion a , associating with any element a the length of a maximal chain 
between the least element and a, satisfies the identity 

a (a) + a(b) = a {a A b) + a (a V b). 

Of course, if a and b are related and a (a) = cr(b) then a = b, so that a 
is in fact a scale in the sense of Section 1.3. Conversely, the existence 
of a function with these two properties forces modularity (cf. Birkhoff 
[3] and von Neumann [145]). Indeed, in a pentagon with atoms a,b 
and a coatom c above a, 

a (a) +a(b ) = u{aAb) +a(a V b ) = a{cAb) + <r(cV b) = <r(c) +<j(6), 
hence a(a) — <r(c), while a < c. 

A further adjunction, not contained explicitly in Dedekind’s work but 
closely related to the above one, concerns 

Distributivity and Product Representation of Lattices 

For any two dements a,b of a bounded lattice L, the maps 

A : [0, a] x [0, b] — > L, (u, v) h-* u V v and g : L — > [0, a] x [0, b] , x f-A 
(a A x, b A x) 

form an adjoint pair. Furthermore, (A, g) is a perfect adjunction iff 
(a, b) is a bi-modular, complementary and distributive pair, i. e. 

aMb M a, a Ab = 0 and x = (o A x) V (b A x) for all x € L. 

Every direct product representation L ~ x L 2 is, up to isomorphism, 
of the above form. In particular, any complementary distributive pair 
of elements in a modular lattice gives rise to a direct product repre- 
sentation, and conversely. 

This result is of importance, for example, in the famous decomposition 
and coordinatization theory for complemented modular lattices due to 
Garrett Birkhoff and John von Neumann (see [3] and [145] for details). 




Adjunctions and Galois Connections 



57 



2.6 Ernst Schroder’s Algorithms and Calculi 

Schroder’s work may be seen as the ultimate fulfilment of the 
endeavour of logicians over half a century to evolve an adequate 
treatment of the formal side of logic. 



G.T. Kneebone 

Ernst Schroder probably was not only the inventor of the first semi- 
formal system of lattice theory (perhaps even before Dedekind), but 
also the first mathematician to investigate the interplay between logic 
and (a restricted form of) universal algebra, involving certain Galois 
connections. Although he did not use that terminology nor the precise 
mathematical definition of such connections, they occur implicitly in 
at least three problem areas discussed extensively by Schroder in his 
monographs, articles and lectures [127] - [131]: 

(1) The role of the distributive law and its (non-) provability in his 
“logical calculus”, the first axiomatic approach to lattice theory. 

(2) The foundation of (parts of) universal algebra and its links to 
lattice theory, exemplified by the equational theory of quasi- 
groups. 

(3) The idea of extents and intents of concepts, leading a century 
later to the formal concept analysis initiated by R. Wille [59, 
153], 

In honor of Schroder’s pioneering work in the area between mathe- 
matics, logic and philosophy, a modern institution founded by Wille, 
dealing both with philosophical and mathematical aspects of formal 
concept analysis, its applications and related theories, was baptized 
Ernst- Schroder-Zentrum fur Begriffliche Wissensverarbeitung. 

Schroder’s early investigations of certain systems of axioms and pos- 
tulates building the framework for the later theory of lattices will 
be discussed more thoroughly in Section 4.1. May it suffice for the 
moment to point out that, in his own later work, Schroder carefully 
distinguished between the “identical calculus” (or “calculus of iden- 
tities”), the earliest manifestation of a systematic theory of Boolean 
logic and algebra, and the “logical calculus”, which, in the terminology 
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of the 20th century, is lattice theory combined with certain semantic 
interpretations by classes, statements, conclusions, etc. 

Schroder was quite proud of having found a counterexample that dis- 
proves the general validity of the distributive law in the “logical cal- 
culus”. For his debate with Peirce about that point, see Section 4.1.) 
The counterexample alluded to, though outdated soon by Dedekind’s 
much simpler construction (see the previous section), opened a pro- 
found insight into the axiomatics of lattice theory. But it is also of 
interest with regard to the development of Galois connections, because 
it stemmed from Schroder’s theory of “algorithms”, perhaps the first 
historical instance of an equational theory and its models, and there- 
fore of a class-theoretical Galois connection induced by the relation of 
satisfaction. He considers a special type of “functional equations” for a 
multiplicatively written operation, together with two other operations 

” and “/” so that 

ab = c 4=>- c = a:b 4^ b = a/c. 

Thus, he deals with nowadays so-called quasigroups. Observe that the 
above rules just define residuated groupoids (see Chapter 4) with the 
identity relation as order. Schroder deduces the “principles” 

b = a(b : a) = ( b/a)a = a/ (a : b) = a : ( a/b ) = ( ba)/a = (ab) : a 

and considers the finite set U of all identities derivable from these prin- 
ciples and containing exactly three - possibly equal - variables on both 
sides. By a “group of formulas” or an “algorithm” he means “a system 
of formulas in U that does not imply any formula in U (by virtue of 
the principles) that it does not already contain.” In other words, using 
modern terminology, an “algorithm” is a deductively closed system or 
an equational theory of quasigroups. Schroder points to the difference 
between the [closure] system of all algorithms as a model for his “log- 
ical calculus” and the system of all subsets of U (which is a model for 
Boolean algebra, his “identical calculus”). 

By virtue of the Galois connection induced by the relation of satis- 
faction, M \= E, meaning that M is a model for E (or satisfies the 
equation E), the model classes for sets of equations always form a clo- 
sure system and consequently a (complete) lattice. This is essentially 
what Schroder has in mind when he considers his “groups of formulas” 
as the “members of a logical calculus” , while the notion of “closure” 
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seems not to occur explicitly in that work. Again, it becomes evident 
that the word “group” had a very vague meaning at the end of the 
19th century. Later, Schroder wrote in his Lectures on the Algebra of 
Logic [131, p.630] (free English translation): 

Given a system of things, called “elements” , and a process 
deriving new elements from the given ones, we may add 
those new elements to the system ... [and after iteration 
of that process] the given elements together with the de- 
rived ones form a system, which we may call the “group ” 
determined by the former elements. 

Thus, Schroder’s “groups” essentially are systems closed under certain 
processes, hence closure systems or, in the more specific realm of alge- 
braic operations, systems of subalgebras. The fundamental idea that 
extending the system of algorithms, axioms or rules decreases the sys- 
tem of “groups” and vice versa, inherent in Schroder’s work, opened 
the way to the class-theoretical Galois connections between equational 
theories and varieties of modern universal algebra. 

Whereas the logical product of “algorithms” (or “groups”) is simply 
their intersection, the logical sum is not the union but the collection 
of all formulas derivable from those in their union by virtue of the 
‘principles’. In other words, the sum is the Galois closure of the union 
with respect to satisfaction. 

Schroder disproves the distributive law by what he calls “exemplifica- 
tion”. The counterexample he gives (together with a Venn diagram) 
is a proper inclusion 

AB + CB c {A + C)B 

where A and C are the algorithms of associative and commutative 
quasigroups, respectively, and B is a certain algorithm consisting of 
consequences of the identity ab = a : b = a/b. Admittedly, as Hunt- 
ington said, that example is rather complicated (e.g. compared with 
that given by Dedekind), but for Schrdder it was both natural and 
substantial in view of his elaborated theory of quasigroups. In a later 
edition, he cited three further counterexamples due to other authors, 
two of which are of interest from the perspective of later developments 
of lattice theory and its connections with geometry. 
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One of these examples, due to Voigt [144], may be regarded as a typical 
“context” (as considered in formal concept analysis, see Section 1.5), 
hence as an instance of a Galois connection. The “symmetry relation” 
between quadrangles and orthogonal maps rise to a non-distributive 
(and not even modular) “concept lattice” , consisting of seven concepts: 



square 

□ 



rect angle , 



rhombus 



trapezoid\ parallelogram^ j_kite 



quadrangle 



However, Voigt and Schroder did not consider that whole lattice, but 
argued that the symmetry group of the square, which is the concep- 
tual synthesis of the rectangle and the rhombus, contains not only the 
axial symmetries of these two “constituents” , but further symmetries 
(obtained by rotation). Hence, the symmetry groups of the square, the 
rectangle, the rhombus and the parallelogram together with the group 
of rotations form a non-distributive (but modular) lattice. This “di- 
amond” lattice may be regarded as a sublattice of the closure system 
of all subgroups of the dihedral group D 4 of order 4x2 = 8 (which are 
represented by the incidence-preserving permutations of the vertices of 
a square). Notice that the cyclic group generated by the permutation 
(1234) cannot be represented as a symmetry group of quadrangles, 
because a quadrangle fixed by that permutation must alredy be a 
square and has therefore the full symmetry group D 4 . But the whole 
subgroup lattice contains a lot of further 5-element non-distributive 
sublattices, among them also non-modular ones. 
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The other example is due to Korselt [79]. The elements are the points, 
lines and planes of 3-dimensional euclidean space, together with the 
empty set and the whole space - in other words, the affine subspaces of 
R 3 . Since these objects are closed under arbitrary intersections, they 
form a complete lattice, ordered by containment. This closure system 
consists of all Galois-closed sets with respect to the incidence relation 
between points and planes. Of course, the join of two affine subspaces 
is the least affine subspace containing both, and this is almost never 
their union. A plane, two parallel lines contained in it, a point on 
one of the lines, and the empty space form a pentagon sublattice, so 
that the whole lattice cannot even be modular (though being upper 
semimodular). Later, Korselt modified his example to a projective 
geometry [80, 81]. Thus, he made a first step towards the lattice- 
theoretical development of (projective) geometry. 

Let us conclude this section with a quotation (in free English trans- 
lation) from Schroder’s “Operationskreis” . It concerns the “concept 
of concept”, fundamental for the philosophical foundation of formal 
concept analysis (see Section 1.5): 

The linguistic expression of a class usually is a common 
name and gives rise to the foundation of a concept, in 
which the essential attributes shared by all individuals be- 
longing to the class are thought of being comprised. In 
contrast to those attributes, the so-called “intent” of the 
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mentioned concept, the class itself represents its “extent”, 
so that computing with class symbols actually means to deal 
with the concepts represented by their extent. 

Since the passage from extents to intents turns the whole concept 
lattice “up-side down” , this approach to lattice theory makes the du- 
ality principle particularly evident. And it is the merit of Schroder 
to have pointed out that duality, which was missing in Boole’s earlier 
development (Section 4.1). 

2.7 David Hilbert's Nullstellensatz 

Sie haben da ein unermefiliches Feld mathematischer Forschung 
erschlossen, welches als ‘Mathematik der Axiome’ bezeichnet 
werden konnte und weit iiber das Gebiet der Geometrie hin- 
ausreicht. 



Hurwitz in a letter to Hilbert, 1903 

Hilbert’s axiomatization of geometry [74] is closely related, via inci- 
dence relations, with certain Galois connections that also played a role 
in the later lattice-theoretical developments of geometry by Menger 
[94, 95, 96], von Neumann [145], Birkhoff [12, 14] and others. We 
shall address here a related fundamental discovery due to Hilbert that 
is based on a very natural Galois connection, namely his famous “Null- 
stellensatz”. 

One of the most frequently occurring types of relations leading to 
interesting Galois connections is the “zero relation” or “vanishing re- 
lation” 

f N z <£> f(z) = 0 

between functions / from a fixed structured set S into a structured set 
T having a “zero element” 0, and elements z of S. For example, one 
may consider a topological space S and the ring C(S) of all continuous 
functions from S into the reals. The interplay between topological 
and algebraic properties in that context proved extremely fruitful, as 
documented, for example, in the book Rings of Continuous Functions 
by Gillman and Jerison [63]. Let us mention only one aspect of Galois 
connections in that context. The partners of the Galois connection 
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($, \&) associated with the above zero relation _L send any subset Y 
of the space S to the ideal $(F) = Y± of all continuous functions 
vanishing on Y, and in the opposite direction to each set F of functions 
in C(S) its zero set 

W(F) = F ± = {zeS : f(z ) = 0 for all / G F}. 

By continuity of the maps, each zero set is closed; but when are all 
closed subsets of the space S Galois closed, i.e. zero sets? Necessary 
and sufficient for that coincidence is that for any closed set Y and any 
point z outside Y, there be a continuous real function vanishing on Y 
but not at z. In other words, the space must be completely regular. 
Thus, the aspect of Galois connections makes it evident why com- 
pletely regular spaces play a central role in the theory of continuous 
real functions. 

Algebraic versions are obtained by considering endomorphism rings or 
polynomial rings instead of rings of continuous functions. A rather ele- 
mentary but useful tool of linear and geometric algebra is the following 
result, obtained from the Dual Isomorphism Theorem for Submodules 
(see Section 1.4) by passing from endomorphism rings to matrix rings 
via the selection of bases: 

Galois Connection Between Ideals and Subspaces 

Let A denote the endomorphism ring of a finitely generated vector 
space B. The polarity associated with the vanishing relation between A 
and B induces a dual isomorphism between the lattice of all (principal) 
left ideals of A and the lattice of all subspaces of B. 

Now, let us turn to the central theme of this section and consider an 
algebraic closure C of a field K. For the ring K[x] of polynomials 
in one variable over K, the situation is easy to describe. For each 
subset Y of C, the polynomials having all elements of Y as roots form 
a (principal) ideal. Hence, there is a unique monic fy G K[x] dividing 
precisely those polynomials which vanish on Y. If Y is finite then fy 
is the product of the distinct minimum polynomials of elements in Y ; 
otherwise, fy is the zero polynomial. Denoting by Zf the zero set of 
any / G K[x\, we have the equivalences 

fy divides / 44- f(y ) = 0 for all y G Y Y C Z;, 
showing that the maps 
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$ : V{C) K[x], f Y and : K[x\ -> P(C'), / ^ Z 7 

form an adjoint pair, where K[x] is ordered by divisibility and the 
power set V(C) by inclusion. A subset of C is dosed under conjugation 
iff it contains with ceC all other roots of the minimal polynomial of c 
over K. The above adjunction then induces an isomorphism between 
the closure system of all subsets of C that are either finite and closed 
under conjugation or equal to C, and the lattice of all monic squarefree 
polynomials over K. 

Focussing (like Galois) on one particular equation f(x) = 0, where 
/ is separable (has no multiple roots), one obtains an isomorphism 
between the closure system of conjugation closed subsets of Zj and 
the finite lattice of all monic divisors h of / that are irreducible over 
K(h). 

More interesting and of fundamental importance for algebraic geom- 
etry is the case of a polynomial ring R n = K[xi, ..., x n ] in several 
variables. Given a subset F of R n , denote by V(F) its common zero 
set, i.e. 

V(F) =F ± = {c= ( Cl , ..., c n ) G C n : /(c) = 0 for all / G F}. 

In algebraic geometry, such zero sets are referred to as ( algebraic ) K- 
varieties. The radical of an ideal I in a commutative ring R with unit 
is the set 

y/l = {/ G R : f m G / for some m}. 

A radical ideal is an ideal I that coincides with its own radical, which 
is tantamount to saying that f 2 G I implies / G I. 

There is a beautiful Galois connection relating these two notions and so 
building an important bridge between algebra and geometry. First of 
all, note that V(F) coincides with V(I), where I is the ideal generated 
by F. The equations V(0) = C n and V(F) U V(G) = V(FG) show 
that the closure system of all K-v arieties is closed under finite unions, 
hence topological. The corresponding system of open sets is called the 
Zariski topology on C n . 

Isomorphism Theorem for Algebraic Varieties 

Let C be an extension of the field K and R n = K[x i, ..., z n ] the poly- 
nomial ring in n indeterminates over K. The following statements 
are equivalent: 
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(a) For each n, the Galois connection associated with the zero rela- 
tion between R n and C n induces a dual isomorphism between the 
closure system of K -varieties in C n and that of radical ideals of 
R n . 

(b) For each n and F C R n , the zero set V(F) is nonempty if (and 
only if) F generates a proper ideal. 

(c) For each n, the Galois closure of any ideal I of Rn is its radical 

Vi. 

(d) For each n and all ideals /, J C R n> V(I) C V(J) implies \fj C 

Vi. 

(e) For each n and F,G CR n with V(F) C.V(G) there is an r such 
that each product of r factors in G belongs to the ideal generated 
by F. 

Proof (a) =>■ (b). Let I = ( F ) denote the ideal generated by F. If 
V(F) = V(J) = V(Vl) = 0 then by the dual isomorphism, \fi = R n , 
i.e. 1 e I = R n . 

(b) =>■ (c). Let / be any ideal of R n . If f m G / then clearly / van- 
ishes on V(I), which means that / belongs to the Galois closure of L 
Conversely, if the latter is the case, consider the ideal / generated by 
I[x] U {1 — xf} in the polynomial ring R = R n [x). If 1 0 / then by 

(b) , there would exist a (y, z) € C n x C ~ C "+ 1 with g(y, z) — 0 for all 

g G /, in particular g(y) = 0 for all g G I and 1 — zf(y) = 0, contradict- 
ing the hypothesis that / belongs to the Galois closure of / (and there- 
fore f(y) = 0). Hence, 1 G /, say 1 = g + (1 — xf)h for some g G I[x\ 
and h G If / is n °t the zero polynomial, it is invertible in the 

quotient field of R n . Substituting Iff for x, one obtains 1 = g(l/ f), 
and for the degree m of g, it follows that f m G /. (This idea is due to 
Rabinovich [117]). 

(c) =t*(e). Since V(G) = V((G)) and (G) is finitely generated by 
Hilbert’s Basis Theorem (see [72], [90] or [142]), one may assume that 
G is finite, say #G = k. Now, V(F) C V(G) implies G C VW) b y ( c )- 
Hence, by finiteness of G, there is an exponent s such that g s G (F) 
for all g G G, and for r = sk it follows that g\...g r G (F) whenever 
gi,...,g r G G (because one factor must occur at least s times). 

The implications (e)=>(d) and (d)=^>(a) are straightforward. □ 
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The radical ideals are precisely the intersections of prime ideals, and 
the latter correspond to the (union-) irreducible A- varieties via the 
dual isomorphism induced by the zero relation. 

That the situation described above frequently occurs in practice (for 
example, when C is the field of complex numbers) is assured by 

Hilbert’s Nullstellensatz (Zero Set Theorem) 

The equivalent conditions in the previous theorem are fulfilled if C is 
algebraically closed. 

Various rather short proofs of that theorem are to be found in the 
modern standard literature on algebra and algebraic geometry (see, 
for example, [84] or [90]). Historically, it has to be noted that in his 
famous 1893 paper “Uber die vollen Invariantensysteme” [73], Hilbert 
formulated the Nullstellensatz in a slighly different form, namely as 
condition (e) (which often is lost in modern presentations) for ho- 
mogeneous polynomials and the complex number field C. Actually, 
Hilbert’s primary purpose was the application to the theory of invari- 
ants. Some of his concepts come close to the idea of Galois closure, 
when he considers for a given set of (homogeneous) polynomials the 
collection of all polynomials vanishing on the common zero set of the 
given ones. However, the notion of Galois connections does not arise 
explicitly in that context. 

For finite fields , the situation becomes particularly simple. 

Theorem on the Polynomial Completeness of Finite Fields 

For a field K, the following conditions are equivalent: 

(a) K is finite (a Galois field). 

(b) For any a G K n there is a polynomial f a G K[x \, ..., x„] with 
faifl) = 1 and f a (b ) = 0 forbe K n \ {a}. 

(c) Any function g : K n — t K is a polynomial function. 

(d) Each subset of K n is the zero set of a polynomial. 

Hence , for any finite field K, the Galois connection associated with the 
zero relation between K[xi , ..., x n ] and K n is right perfect. In other 
words, there is a dual isomorphism between the power set of K n and 
the closure system of all radical ideals of K[xi, ...,x n ], under which 
the singletons correspond to the prime ideals (by associating with any 
point a G K n the ideal of all polynomials vanishing at a). 
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For (a)=>(b), use the modified interpolation polynomials 

fa{x 1 , Xn) = n;=i riceir\{aj} ^r c for a = (oi, a n ) E K n . 

For (b)=>(c), take the polynomial / = ^2 aeK n <?(a)/ a to obtain f(b) = 
g(b ) for all b E K n . The implications (c) =^(d) =^(a) are clear. 

A modern algebraic variant of the correspondence between varieties 
and radical ideals should be mentioned at the end of this section, 
because it involves another natural Galois connection. Let R be an 
arbitrary commutative ring with 1 and S(R ) its spectrum , the set of all 
prime ideals, equipped with the hull-kernel topology (also sometimes 
referred to as the Zariski topology ). The closed sets for this topology 
are the collections 

$(£) = {PE S{R) \BCP} {BCR) 

and they are in fact the Galois-closed subsets of S(R) with respect to 
the membership relation E between elements and prime ideals of R. 
On the other hand, the Galois-closed subsets of R are the intersections 
of prime ideals, and these are known to be precisely the radical ideals. 
(For the latter conclusion, one needs Krull’s Separation Lemma, see 
Section 4.2). Thus, we have the 

Isomorphism Theorem for Radical Ideals 

The polarity associated with the membership relation between elements 
and prime ideals of a commutative ring R gives rise to an isomorphism 
between the closure system of radical ideals and the hull-kernel topology 
on the spectrum. Under that isomorphism, the prime ideals correspond 
to the irreducible closed sets. 

The similarity between the last theorem and that on algebraic varieties 
is not casual, but there is an explicit link via continuous functions. For 
any element z of C n , the polynomials in R n = K[x i, ...,x n ] vanishing 
at z form a prime ideal P z , and the function z h->- P z is continuous 
as a map from C n with the Zariski topology to S(R n ) with the hull- 
kernel topology. Indeed, in modern terminology, the latter space is the 
sobrification or sober reflection of the former. Here, a space is sober 
iff each irreducible closed set is the closure of a unique point (see e.g. 
Johnstone [78]). Any topology is lattice-isomorphic to that of the 
sobrification. In the present situation, both topologies are isomorphic 
to the system of all radical ideals in R n . 
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3 Galois Connections in the Twentieth Century 

Die “abstrakte” , “formale” oder “axiomatische” Richtung , der 
die Algebra ihren erneuten Aufschwung verdankt , hat vor allem 
in der Gruppentheorie, der Korpertheorie, ... und der Idealthe- 
orie ... zu einer Reihe von neuartigen Begriffsbildungen , zur 
Einsicht in neue Zusammenhdnge und zu weitreichenden Re- 
sultaten gefuhrt. 

From the introduction to 
van der Waerden’ s Algebra 

3.1 From Dedekind to Noether, Artin and van der 
Waerden: 

The Modern Galois Theory 

Richard Dedekind was one of the first mathematicians to present the 
Galois theory in its modern (field-theoretic) form. As we saw, great 
parts of that theory are already contained in his unpublished Lectures 
on Algebra [26]; others are to be found in his celebrated Supplement 
XI to Dirichlet’s Lectures on Number Theory [27]. A concise and well 
readable survey on the state of Galois theory at the beginning of a 
new century is Dedekind’s 1901 paper [32]: 

Uber die Permutationen des Korpers alter algebraischen Zahlen {On 
the permutations of the field of all algebraic numbers) 

which opened the door to the algebra of the twentieth century, includ- 
ing already first investigations on infinite and normal field extensions. 

Ernst Steinitz [136], Emmy Noether [102], Emil Artin [5] and B.L. 
van der Waerden [142] are prominent representatives of the modern 
algebra in the first half of the twentieth century. They worked in the 
tradition of Dedekind and brought Galois Theory to a new culmination 
point. 

Below, we collect together those ingredients of the Fundamental The- 
orem of Galois Theory that can be found in Dedekind’s later work. 
Then we shall point out how Galois connections simplify the consid- 
erations. We also include some of the ingenious ideas contributed by 
Artin. 




Adjunctions and Galois Connections 



69 



By a “permutation” of a subfield K of the field C of complex num- 
bers, Dedekind means in [32] a monomorphism 7r from K into C (that 
is, an isomorphism onto a subfield of C). Dedekind uses here the 
modern notation an for the image of a under n. Of particular inter- 
est from the modern order-theoretical perspective is his idea to order 
not only the fields by the subfield (“divisor”) relation, but also the 
monomorphisms, calling a monomorphism ip : K — > C a “divisor” of 
a monomorphism ip : L -» C (and xp a “multiple” of <p) if <p is ob- 
tained from xp by restriction. He observes that with any system P 
of such monomorphisms one may associate the subfield K = $(P) of 
all elements on which these monomorphisms coincide, and that the 
common restriction of all n E P to K is then the greatest common 
divisor (the so-called “remainder”) of P. Next, Dedekind cites two 
important theorems from his Supplement [31]. 

Theorem on Permutations of Finite Field Extensions 

If the field M is a finite extension of the field K and n : K — >■ C is any 
monomorphism then the set M n of all multiples of n with domain M 
satisfies K = n is the remainder of M n , and the degree [M :K] 

is its cardinality. In particular, if H is the set of all monomorphisms 
from M into C fixing K pointwise then K in turn is the fixpoint set 
of H, and 

#H=[M:K], 

Thus, in modern terminology, the Galois connection between sets of 
monomorphisms from M into C and fields K having M as a finite 
extension, induced by the fix relation ip ~\ a <3- <p(a) = a is right 
perfect. 

First Theorem on Finite Automorphism Groups 

If G is a finite group of automorphisms of a field M and K = $(G) 
is the field of its common fixpoints, then M is a finite extension of 
K whose degree is the order of G, and G is the automorphism group 
T(K) of M overK: 

[M:K} = #G and G = T(K) = T($(G)). 

The second equation follows from the first by the general theorem 
in Section 1.3. In the booklet Galoissche Theorie [5], Artin gives a 
nice proof of the last theorem, using basic tools of linear algebra and 
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another theorem due Dedekind, namely the 

Character Independence Theorem 

Any set of homomorphisms from a semigroup into the multiplicative 
group of a field is linearly independent. 

Artin’s proof of the Theorem on Automorphism Groups shows that 
the equation [M : K] = includes the infinite case, saying that the 
group G is infinite if and only if so is the extension M : K. The proof of 
the Character Independence Theorem is a quick but tricky induction 
which we omit here. 

Consider ai,...,a n £ G and fq, ..., b m , ci, ..., c n £ M such that the bj 
are linearly independent but satisfy Gi(bj)ci = 0 for j = 1, m. 
Then, for any a = a jbj £ M with aj £ K, 

Ellice = Er=i%Er=i^(^')c = 0. 

Thus, if the bj generate M, all coefficients c* have to be zero, by the 
Independence Theorem. This shows that n cannot be greater than m, 
hence )fG < [M : K], including the fact that if G is infinite then so is 
[M : K}. 

We may now assume that G = {cti, ..., a n }. If n < m, there is a 
non-zero vector (di, £ M m with 

J2jLi cr i~ 1 (^j)d‘j = 0 f° r * = 1 

Again by the Independence Theorem, the trace a = EiLi ai ^ oes 
not vanish everywhere, so one may assume cr(dk) ^ 0 for some k. 
Application of cq and summation over i yields the identity 

o = Eiu Er= i = sr=i b i 

But the values cr(dj) are fixed by the automorphisms cq, hence ele- 
ments of K , which however contradicts the linear independence of the 
bj. Therefore, the dimension (degree) of M over K is exactly n. 

An extension M : K where K is the fixpoint field of some finite set P 
of automorphisms of M is referred to as a (finite) Galois extension. 
The subsequent equivalent characterizations of such extensions can be 
found in any modern textbook on classical algebra (see, for example, 
[ 5 , 90 , 98 , 142 ]). 

Characterization Theorem for Galois Extensions 

For a finite field extension M :K, the following are equivalent: 
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(a) M:K is a Galois extension. 

(b) [M: K\ = #r(K). 

(c) M:K is normal and separable. 

(d) M is the splitting field of a separable polynomial over K. 

In Dedekind’s studies, separability plays no role since he confines his 
considerations to subfields of C. However, he is aware of the difficul- 
ties arising in the theory of infinite extensions; here, he shows (by a 
countability argument) that every monomorphism on a subfield of a 
field A of algebraic numbers has a “multiple” (that is, a monomorphic 
extension) with domain A. Next, he explains why the above theorems 
comprise the essence of Galois theory. Starting with a finite group G 
of automorphisms of the field M and the fixpoint field K = $((?), he 
formulates the crucial one-to-one correspondence between the inter- 
mediate fields of M : K and the subgroups of G. Using the modern 
language of Galois connections, one may rephrase his explanations as 
follows (see [5] or [98]): 

Fundamental Theorem of Galois Theory 

For any Galois extension M : K, the polarity of the fix relation induces 
mutually inverse isomorphisms $ and T between the closure system 
Q = Q{M:K) of all subgroups of the Galois group G = P(i’f) and the 
closure system J- = of all subfields of M containing K. 

Let us see what is needed for the proof. We have the following four 
maps: 

the inclusion inverting map 

H^{aeM : VaeH (a (a) = a)}, 

its (dual) Galois adjoint 
T : JF — > C?, L {cr E G : a£ L (a (a) — a)}, 

the isotone ordinal scale into N = o;\{0}, ordered by divisibility, 

: G -4 N, 

and the antitone degree scale 

Ajr : ^-^N, L i— > [. M:L ]. 

By the equivalence (a) (g) in the Theorem on Perfect Galois Con- 

nections (Section 1.3), the only two details that have to be checked 
are the identities = A g and Ago]? = A?. But these are just 

the above equations 
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[M :$(#)] = ## and #r(L) = [M:L\. 

The first equation was established in Artin’s proof of the First Theo- 
rem on Finite Automorphism Groups. One obstacle arises in the ver- 
ification of the second equation: one must know that whenever M : K 
is a Galois extension then so is M : L, for every L 6 T{M : K). Of 
course, one may solve that problem by invoking the Characterization 
Theorem and observing that normality and separability are inherited 
by the intermediate extensions M : L. But the proof of those facts is 
not easy and requires a lot of extra definitions and tools. If one wishes 
to circumvent that difficulty, one may alternatively use the following 
result due to Artin: 

Second Theorem on Finite Automorphism Groups 

Let G be a finite group of automorphisms of a field M. Then each 
intermediate field L between <f>(G) and M satisfies the equations 

[G:r(L)] = [L:$(G)], [M : L] = #r(L), 

and M :L is a Galois extension. 

For the proof, put K = $(G) and H = r(L). The first step is to prove 
the inequality 

[G:H] < [L:K], 

To that aim, Artin considers the set S = {a\ L : cr £ G} and observes 
that two automorphisms a,r £ G coincide on L iff Ha = Hr. Thus, 
the cardinality of S is the index [G : H], and the same argument as 
in the proof of the First Theorem on Finite Automorphism Groups 
yields the inequality ffS < [L:K], Now, using the degree and index 
formulas, one obtains 

[M:L} = [M:K}/[L:K] = #G/[L:K] < #G/[G:tf] = #H. 

Thus we have 

Ayr < AgoT, 

which together with the second degree formula ffH = [M : $(#)], i.e. 



A g — AyrO $ 

provides all hypotheses needed in order to apply the general theorem 
on (Right) Perfect Galois Connections. 
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3.2 Garrett Birkhoff’s Polarities 

As mentioned earlier, many Galois connections (among them the clas- 
sical one between subfields and subgroups) arise from a relation R 
between a set A and a set B of functions whose domain includes A, 
so that R is defined in terms of a certain equation involving members 
of A and B. 

Apparently, Garrett Birkhoff was the first to point out the fundamen- 
tal construction of Galois connections, termed polarities , from arbi- 
trary relations between two sets. The initial publication containing a 
systematic treatment of these ideas is the 1940 edition of Birkhoff’s pi- 
oneering Lattice Theory [3], a textbook that today is still a rich source 
of inspiring concepts, facts and methods in the theory of lattices and 
ordered sets. 

Let us recall Birkhoff’s construction, now well-known and applied 
in hundreds of mathematical and extra-mathematical problems, and 
sketched briefly in Section 1.4. It starts with a relation between (the 
elements of) two sets A and B, that is, with a subset R of the carte- 
sian product Ax B. Naturally associated with such a relation are two 
canonical maps, = R~ * and = *~R, between the power sets: 

$ : V{A) ->■ V(B), X ^ X R = {y e B : xRy for all x <= X} , 

: V(B) — »• V(A), f 4 Y R — {x E A : xRy for all y € Y}. 

While Birkhoff writes X* for X R and Y t for Y R , Ganter and Wille 
[59, 153] prefer to use the same symbolic notation X' and Y' for these 
sets. Since often several relations have to be considered simultane- 
ously, we have chosen a notation that indicates the dependence of 
the relation in question. Furthermore, as the relations need not be 
symmetric, we distinguish between different positions. 

As Birkhoff remarks next, the maps $ and 4/ invert inclusion, and 
their composites are extensive, that is, 

ICW = X r r and Y C W* = Y R R . 

Because of the immediate equations 

$o$of = $ and o o <f> = $, 

it is now clear that the composite maps 

W o $ : V(A) V(A), X ^ and 
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$ o : P(£) ->■ P(5), Y ^ Y t* 

are closure operators, and that $ and \& induce mutually inverse dual 
isomorphisms between the associated closure systems 

Cd> = {$(X) = X* : X C A} = {Y C B :Y = yt*}, 

which is the range of $ as well as of $ o , and 

Css = {tf (y) = y f : Y C B} = {X C A : X = A**}, 

which is the range of ^ and of \k o <f>. In particular, for X C one 
has 

(V X)* = f){X*:X £ X}, (f| *)* = V{^* : X £ X}, 
and analogously for subsets of 

Birkhoff mentions the following four “classical” examples: 

• A is a group of automorphisms of a field and aRy means a(y) = y 
(leading to the Galois connection between subgroups and subfields), 

• A is a set of functions on the space R", and ipRy means ip(y) = 0 
(leading to connections between varieties and Zariski topologies etc.), 

• A = B is a ring, and xRy means xy = 0 

(leading to the connection between left and right annihilator ideals), 

• A = B is a partially (or quasi-) ordered set with order relation R 
(leading to the completion via Dedekind-MacNeille cuts). 

The term “polarity” is borrowed from the following examples of sym- 
metric relations (where A coincides with B and R with the dual rela- 
tion): if A = B is a finite-dimensional euclidean space with scalar 
product ( , ) then, for the orthogonality relation x _L y given by 
(x, y) = 0, the map $ = associates with any subset X its orthog- 
onal space X 1 , and it induces a dual automorphism on the lattice of 
all subspaces. In the case of (infinite-dimensional) Hilbert spaces, one 
still obtains a dual automorphism, but only on the lattice of all closed 
subspaces. 

Two further examples mentioned in [3] are the symmetric relation S on 
a group given by xSy xy = yx, for which $ = carries each sub- 
group to its centralizer, and the symmetric inequality relation, which 
leads to the dual automorphism of power sets via complementation. 

As observed by Everett [54] and mentioned already in Section 1.4, not 
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only does every relation between two sets (which may be identical) give 
rise to a Galois connection between the corresponding power sets, but 
also conversely, every Galois connection between power sets arises in 
that fashion from a unique relation: indeed, if $ : V(A) —>V(B) and 

: V(B) —tV{A) are two maps satisfying 

X c (y) & Y c $(X) (X CA,Y CB) 

then there is a unique relation R C Ax B with $ = R~* and Ik = ^R, 
viz. 

xRy x G \&({y}) 43- y E <f>({a;}). 

Of course, this is the only possible way to define R so that $ = i?" 4 , 
and if one defines R by that equivalence then one actually obtains 

R~*(X) = {y : xRy for all x £ X} = 

{v e B : X C *©})} = 
and analogously *~R(Y) = \k(F). 

Many interesting isomorphism theorems of order theory, algebra and 
topology later turned out to be induced by a rather simple polarity 
- and indeed, the general theory says that every dual isomorphism 
between two closure systems (respectively, every isomorphism between 
a closure system and a kernel system) arises in that fashion from a 
relation. 

The example of an order relation and the associated polarity deserves 
particular attention, also from the historical point of view. As ob- 
served by MacNeille [91] in 1937, a few years before the publication 
of Birkhoff’s Lattice Theory , Dedekind’s famous construction of the 
reals from the rationals by so-called cuts [27], published first in 1872, 
extends to arbitrary posets, providing a universal completion. There 
are at least two alternative ways of defining cuts in arbitrary posets: 
in MacNeille’s original article, cuts are pairs ( X , Y) with X = F< (the 
set of all lower bounds of Y) and Y = X- (the set of all upper bounds 
of X). This approach follows the idea of Dedekind and was generalized 
later in formal concept analysis, where the notion of cuts was extended 
to that of concepts (see Section 1.5). The obvious up-down duality 
of that “two-sided” definition has certain conceptual advantages but 
is technically less convenient than the second approach, preferred by 
Birkhoff [3], where the first components T< are regarded as (lower) 
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cuts; Birkhoff also speaks of “closed ideals ”, reminding of the fact 
that they are closed sets with respect to the involved Galois connec- 
tion. By definition, they are just the intersections of principal ideals. 
It is evident (and follows from the general remarks on polarities) that 
the lower cuts form a closure system and therefore a complete lattice, 
dually isomorphic to the closure system of upper cuts (intersections of 
principal filters). As Birkhoff points out, a poset is complete if and 
only if every cut is already a principal ideal, while in general, any 
poset P is embedded in the closure system MP of all cuts by sending 
each element to the principal ideal generated by that element: 

rj : P —>J\fP, x I— >■ — {p £ P : p < x} . 

This completion by cuts, alias Dedekind-MacNeille completion or nor- 
mal completion , may be characterized abstractly by various universal 
properties (see [ 2 , 10, 15, 40, 10, 45, 46, 124] for details). 

Characterizations of the Completion by Cuts 

For an embedding s of a poset P in a complete lattice C , the following 
conditions are equivalent: 

(a) There is an isomorphism l : NP —> C with e = i o 77 . 

(b) e(P) is join- and meet-dense in C. 

(c) e preserves all existing joins, and e(P) is join-dense in C. 

(d) For every order embedding s' of P in a complete lattice C' , there 
is an order embedding t : C —> C' with s' = 1 o e. 

(e) For every join-dense embedding s' of P in a complete lattice C' , 
there is a unique join-dense embedding l : C — >• C' with e' = ioe. 

(f) For every cut continuous map e' from P into a complete lattice 
C' , there is a unique join-preserving map t : C — >■ C' with e' = 

LOS. 

(g) For every cut stable map s' from P into a complete lattice C' , 
there is a unique join- and meet- preserving map 1 : C —¥ C' with 

s' = L O S. 

A map ip : P — >• Q between posets is (lower) cut continuous if inverse 
images of cuts are cuts (i.e. closed ideals). Every residuated map is cut 
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continuous, every cut continuous map preserves joins, and on complete 
lattices, all three types of maps coincide. Furthermore, ip : P Q is 
cut stable (see [45]) iff 

p(X^)< = p(X^<) and p{X<)^ = p(X^) for all XCP. 

Any such map is lower and upper cut continuous, but not conversely. 

Precisely speaking, Dedekind’s completion of the rationals gives a con- 
ditionally complete lattice (in which all nonempty bounded subsets 
have join and meet); in the general setting of posets, that construc- 
tion may be imitated by taking the nonempty proper cuts only. Here, 
Birkhoff proves the following general result: 

Conditional Completion of Bidirected Posets 

If P is an up- and down-directed poset (all finite subsets have upper 
and lower bounds ) then the nonempty proper cuts form “the” condi- 
tional completion, that is, a conditionally complete lattice in which P 
is join- and meet-densely embedded (which forces uniqueness of the 
conditional completion up to isomorphism). 

Birkhoff’s contributions to the lattice-theoretical development of ge- 
ometry are immense and cannot be reported here (see [14] for one of 
his own late reviews of the research in that field). Since (synthetic) 
geometry is based on the idea of incidence relations, it is plain that 
Galois connections and closure systems or operations play a crucial 
role in that context. However, in order to avoid misunderstandings, it 
should be pointed out that for geometries of arbitrary dimension, it 
is not the incidence between points and lines (objects of dimension 1) 
that creates the desired closure systems of “flats”, but that between 
points and hyperplanes (objects of codimension 1). Indeed, calling 
copoints the completely meet-irreducible members of an arbitrary clo- 
sure system (e.g. the hyperplanes in the case of subspace systems), 
one has the fundamental 

Polarity Between Points and Copoints 

Let C be any algebraic closure system on a set A (that is, directed 
unions of closed sets are closed). Then the polarity associated with the 
incidence relation between points and copoints has C as the system of 
Galois-closed subsets of A, and the principal filters of closed sets are 
the Galois-closed subsets of C. 
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Of course, this result requires the Axiom of Choice in form of a max- 
imal principle, guaranteeing enough copoints in order to generate all 
closed sets via intersection. Abstractly, this is the famous Birkhoff- 
Frink Theorem, saying that in an algebraic lattice, every element is a 
meet of completely meet-irreducibles. Many important results of uni- 
versal algebra rely on that theorem (see e.g. [3] and [25]). 

3.3 Oystein Ore's Galois Connexions 

Oystein Ore was well acquainted with Dedekind’s style and methods, 
and consequently interested in many questions combining order and 
lattice theory with algebraic structures like groups and rings. Let us 
mention here only two typical examples of that fruitful combination: 
Ore’s beautiful theorem characterizing the locally cyclic groups by 
distributivity of their subgroup lattice [104] (for a short proof, see 
[126, p.12]), and Ore’s famous theorem on the exchange of elements 
in irredundant decompositions for elements in modular lattices (see 
[103], and [3, 25] for improvements), generalizing the Steinitz exchange 
theorem for bases of vector spaces. 

Probably the first source where the abstract order-theoretical notion 
of Galois connections emerged explicitly is Ore’s 1944 paper Galois 
Connexions [38]. As he remarks in the introduction, various causes 
urged him to postpone a book on the subject, and it remains unclear 
whether that book has ever been published in some form. Birkhoff 
included the general notion of Galois connections in the second edition 
of his Lattice Theory. 

Ore himself changed between the terms “Galois connexions” , “Galois 
connections”, and “Galois correspondences”, the latter two expres- 
sions being standard (and often synonymous) until today. Though 
Ore’s style is sometimes less precise than Dedekind’s and several of 
Ore’s notational idiosyncracies did not survive ( “connexion” for “con- 
nection”, “structure” for “lattice”, “dual topology” for “closure sys- 
tem”, “correspondence” for “map” etc.), his brilliant structural ideas, 
often based on some very simple but effective order-theoretical con- 
cepts, founded a very fruitful theory of (dual) adjunctions, extended 
later to the categorical concept of adjoint functors, monads and dual- 
ities, developed by the next generation of mathematicians (see [3]). 
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Like some of his mathematical ancestors and contemporaries, Ore used 
the set-theoretical symbol C (or, not always consistently, the symbol 
C) for the order relation; consequently, he interchanged the role of set- 
theoretical and lattice-theoretical symbols, as they are used in modern 
texts. In order to avoid ambiguities, we translate Ore’s symbols into 
the nowadays common ones (< for the order, V for the binary join, 
and A for the binary meet). 

As Ore explains in the introduction, his paper arose from a series of 
colloquium lectures on the theory of mathematical relations, given in 
1941 at the Summer Meeting of the American Mathematical Society 
at the University of Chicago. The mathematical part begins with a 
brief review of the order-theoretical concept of closure operations (Ore 
calls them “closure relations”) . As mentioned in Section 1.2, such 
closure operations x i — > x are characterized by the three properties 
of idempotency (x = x), extensivity (x < x) and isotonicity (x < 
y =>- x <y). Ore cites his own papers [107, 108] and Ward [149] but 
not the work of Dedekind, who had observed (in a slightly different 
setting) that the three defining conditions for a closure operation may 
be combined to a single one, namely x < y x < y (see Section 
2.5). 

Ore defines isomorphisms between partially ordered sets P and Q in 
the usual way as bijections p : P — >■ Q such that 

x < y <p(x) < <p(y), 

and dual isomorphisms between P and Q as isomorphisms between P 
and the dual of Q. Among many possible alternative characterizations 
of such dual isomorphisms, the following two often occur in Ore’s 
paper: 

p = xV y (p{p) = p(x) A (p(y) 
and dually, 

p = x Ay (p(p) = (p{x) V p>{y) 

where p = x P y means that p is the supremum (join, least upper 
bound) of x and y, and dually, p = x Ay means that p is the inhmum 
(meet, greatest lower bound) of x and y. Note that the above equiv- 
alent definitions of dual isomorphisms work not only in lattices, but 
in arbitrary posets. Two posets are (dually) isomorphic if there exists 
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a (dual) isomorphism between them, and a (dual) automorphism is a 
(dual) isomorphism between a poset and itself; finally, a self-inverse 
dual automorphism is called an involution. 

As a first theorem, Ore states the one-to-one correspondence between 
closure operations and meet-closed subsets of complete lattices, ob- 
tained by associating with any closure operation its range (= fixed 
point set), and in the opposite direction with any meet-closed sub- 
set Pi the map that sends to each element the least member of Pi 
above it. A suitable extension of that correspondence to arbitrary 
partially ordered sets was discussed later by Robert Baer [7]; in that 
general context, meet-closed subsets have to be replaced by “partial 
ordinals” in Baer’s terminology ( closure ranges in the sense of Section 
1.2). Recall that we have reserved the term closure systems for the 
set-theoretical version, i.e. for systems closed under arbitrary intersec- 
tions, while by a closure range in a poset P we mean a subset Pi such 
that for each p G P there is a least pi G P\ dominating p. Ore con- 
firms that the closure operations of a complete lattice form a complete 
lattice that is dually isomorphic, under the above correspondence, to 
the closure system of all closure ranges. 

After the preparatory remarks on closure operations, he introduces 
his fundamental “Galois connexions” or “Galois correspondences” as 
pairs of antitone (i.e. order-reversing) maps p : P — >• Q and ip : Q — » P 
whose composites are extensive, that is, 

p<ipop(p) for p G P and q<poip{q) for q G Q. 

Ore does not mention the equivalent characterization by the condition 



P < t%) q < 

but immediately notes the identities ipopo ip — ip and poipop = p, 
from which he deduces that p o ip and ip o p are closure operations 
whose ranges are dually isomorphic. Everybody using Galois connec- 
tions is aware of these basic and useful facts. The converse statement 
however, also due to Ore, seems to be less known: namely, that given 
two complete lattices P and Q with meet-closed subsets P\ and Q i, 
respectively, any dual isomorphism between Pi and Q i is induced by 
a unique Galois connection between P and Q. The partners of that 
connection are obtained simply by composing the involved closure op- 
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erations with the appropriate dual isomorphisms. Again, that result 
extends to arbitrary posets, when meet-closed subsets are replaced 
with closure ranges (see Section 1.2). More precisely, ordering the 
Galois connections between posets P and Q “pointwise” by 

{<p, $) < W, V’O & (<p(p) < <p'(p)) ^ Vq zQ < 

one obtains a poset P eg) Q ~ Q <g> P with the following property: 

Theorem on the Ranges of Galois Connections 

Via restriction to the ranges, the poset P®Q of all Galois connections 
between P and Q is isomorphic to the poset of all dual isomorphisms 
between closure ranges of P and of Q, respectively. If P and Q are 
complete lattices then so is P<S> Q. 

Ore defines “one-sided perfectness” of a Galois connection by requir- 
ing that one of the partners be onto, and he observes that this is 
equivalent to the postulate that the other partner be one-to-one. His 
definition of “perfect Galois connections” is the usual one, requiring 
injectivity (hence also surjectivity) of both partners. Thus, perfect 
Galois connections consist of two mutually inverse isomorphisms (see 
our introductory Section 1.3). 

In the third section of his paper on Galois connections, Ore indicates 
an “interpretation of the theory in terms of continuous maps”, but he 
does not say precisely what he means by an “interpretation”. Also, 
his claim that 

any Galois connection between [complete lattices] P and 
Q corresponds to a mapping of a space with a structure 
of closed sets isomorphic to P onto a dense subspace of a 
space whose structure of closed sets is isomorphic to the 
dual structure Q* of Q 

fails, no matter how the word “corresponds” is interpreted: for the 
unique Galois connection between a one-element and a two-element 
chain, there is no mapping between spaces with the claimed proper- 
ties. The whole section about continuous mappings is a bit vague and 
becomes more transparent in terms of adjunctions rather than Galois 
connections. Indeed, it is possible to describe exactly the passage be- 
tween adjoint maps and continuous maps (see e.g. [41] - [12]). Here, 
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as often, it proves advantageous to invoke the machinery of functo- 
rial equivalences and adjunctions, the modern categorical extension 
of order-theoretical adjunctions (see e.g. [3]). For the moment, the 
following hint may suffice: every complete lattice may be regarded as 
a closure space, by taking the principal ideals as closed sets; in that 
setting, the residuated, i.e. left adjoint maps are precisely the contin- 
uous ones, and for an arbitrary closure space S with closure system 
C, the map 

r]s : S — »■ C, xv-t {x} 

is continuous. Moreover, one has the following connection between 

Continuous Maps and Adjoint Maps 

Let C and C be closure systems on sets S and S', respectively. Then 
an arbitrary map <p : S — » S' is continuous if and only if there is a 
(unique) left adjoint, i.e. join-preserving map $ : C— >■ C' with 

$ o r]s = rjs' o g> . 

Hence, the category of complete lattices and residuated maps is reflec- 
tive in the category of closure spaces, with reflection maps rjs . 

Of course, the “lifted” map $ sends any closed set to the closure of its 
image, and the right adjoint of $ sends any closed set to its inverse 
image. In other words, this adjunction is induced by the axiality 
associated with the continuous map <p. 

Let us mention only a few ideas in that area due to Ore. One of his 
aphoristic statements is that 

every complete structure [lattice] P can be considered to be 
the structure of closed sets for a suitable topological space. 

In order to understand that sentence, one must know that Ore means 
by a topological space a set S equipped with a closure operator ( “clo- 
sure relation”) making the empty set closed, while finite unions of 
closed sets need not be closed for him. Of course, such a “topological 
representation” of a complete lattice P is easily achieved, by endowing 
the underlying set Po = P\ {0} (where 0 = f\P = \J $ is the least el- 
ement) with the closure operator X i-» ]0, \f X], so that the “deleted” 
principal ideals ]0,p] = {x G P : 0 < x < p} form a closure system 
isomorphic to P, and the empty set is closed. Now, Ore claims that 
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any substructure Pi of P with respect to crosscut [i. e. any 
meet-closed, subset] corresponds to a mapping of this space 
onto a space whose structure of closed sets is isomorphic 
to P\. 

Obviously, the mapping he has in mind is the closure operation as- 
sociated with Pi, restricted to Po- This is in fact a continuous map 
(between closure spaces), since the preimage of the deleted principal 
ideal generated by pi in P\ is the deleted principal ideal generated by 
Pi in P. For some more topological constructions involving adjunc- 
tions, see the end of Section 3.4. 

By a Galois connection within a complete structure , Ore means a Ga- 
lois connection between a complete lattice P and P itself. By the 
Theorem on Ranges of Galois Connections, those Galois connections 
form a complete lattice isomorphic (via restriction) to the complete 
lattice of all dual isomorphisms between pairs of meet-closed subsets of 
P. Under that isomorphism, the dual automorphisms of meet-closed 
subsets correspond to the Galois connections within P whose partners 
have the same range. In particular, the involutions of meet-closed 
subsets correspond to those Galois connections whose partners are 
identical. 

Deviating from Birkhoff’s terminology, but in accordance with other 
authors, Ore means by a polarity or orthogonality ( alias orthocomple- 
mentation) an involution under which the image is always complemen- 
tary to its preimage. (Ore speaks of “the” complement, but we know 
from the theory of vector spaces that complements are not necessarily 
unique, even if such an involution exists; also, instead of Ore’s nota- 
tion a* for the orthocomplement, conflicting with his notation R* for 
the dual relation and P* for the dual poset, most authors use the sug- 
gestive symbol aP) Then, Ore says that “because of the importance of 
the applications such polarities are of particular interest ” but gives no 
such examples (in this paper). Of course, interesting applications are 
to be found in the theory of Boolean lattices and, more generally, of 
orthomodular lattices, in particular in the theory of Hilbert spaces and 
their closed subspace lattices [3, 16, 145]. As observed in [34], rather 
mild hypotheses force an antitone self-map p of a bounded lattice L 
to be an orthogonality. Indeed, if 

p(x) < x =>• x = 1 and p(x) = 1 => x = 0 for all x E L 
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and cp is antitone then <p(x) must be a complement of x: 

j := x V p(x) >x=> p(j) < tp(x) <j => j = 1 , 

m := xA<p(x) < <p(x) < <p(m) =>- <p(m) = mV^(m) = 1 =>• m = 0. 

If, moreover, is self-inverse then it is an orthogonality. 

Next, Ore briefly considers Galois connections between a complete lat- 
tice P and its dual, thereby anticipating the concept of adjunctions 
( Galois connections of mixed type in the sense of Benado [11] and 
Schmidt [123]). Ore observes (without proof) that any complete ho- 
momorphism a : P — >■ P of a complete lattice P gives rise to a Galois 
connection (<p, ip) between P and its dual with 

ip(x) = maxjy : a(x) = a(y )} and ip(x) = min{y : a(x) = My)}- 
In Section 5, Ore reports and comments on his own and. Birkhoff’s 

results concerning the passage between relations and Galois connec- 
tions. His claim that the theory of Galois connections be “expressible 
in terms of relations” remains somewhat unclear. What he probably 
means (but what he does not state explicitly) is the following 

Representation of Galois Connections by Relations 

Associating with an arbitrary Galois connection (<p, ip) between posets 
P and Q the relation R C PxQ with 

xRy x < ip(y) y<p(x), 

one obtains an isomorphism between the poset P ® Q of all Galois 
connections and the poset of all G-ideals in PxQ, i.e. relations R 
such that for each x G P, the set {x} R = {y £ Q : xRy} is a principal 
ideal of Q, and for each y E Q, the set {v}r = {x G P : xRy} is a 
principal ideal of P. 

The following diagram (where rjp and tjq denote the principal ideal 
embeddings) commutes: 
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Hence, every Galois connection is induced, up to isomorphism, by a 
polarity. 

Later authors called P ®Q the tensor product of P and Q, though it 
shares the desirable properties of usual tensor products only if P and 
Q are complete lattices. For example, the “associative law” 

P®{Q®R)~(P®Q)®R 

valid for complete lattices, fails for arbitrary posets. The structure 
of such tensor products has been investigated intensively by various 
authors (see, for example, [89, 99, 133]). Remarkable is that the tensor 
product of lattices need not be a lattice [77], and the tensor product 
of distributive complete lattices need not be distributive, while the 
tensor product of completely distributive lattices is always completely 
distributive (see Raney [118] and Shmuely [134]). 

As shown in [43], most of the deficiencies occurring in the theory of 
tensor products for posets are remedied by replacing G-ideals with the 
more general “multicuts” T C PxQ, defined by the condition 

XxYCT => X^< x Y-< C T . 

This definition also opens the possibility of defining tensor products 
with an arbitrary number of factors (see [43] for details). A differ- 
ent tensor product for complete lattices plays an important role in 
formal concept analysis [59] and lattice-theoretical topology [49, 53]. 
For completely distributive lattices (but not for arbitrary distributive 
complete lattices) it coincides with that defined by Galois connections. 

Interestingly, Ore does not consider (binary) relations as the funda- 
mental notion and functions as a derived special kind of relations, but 
conversely, he defines relations as functions R : B — » V(A) from a set 
into a power set and then declares the expression xRy (not yRx !) by 
x e R(y) . Of course, both approaches lead to the same conclusions. 

Next, Ore states and proves Everett’s theorem saying that every Galois 
connection between power sets comes from a unique relation between 
the underlying sets [54]. In slightly extended and supplemented form, 
that theorem may be rephrased as follows: 

Galois Connections Between Power Sets 

For any two sets A and B, the following three collections form mutu- 
ally isomorphic complete lattices: 
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(a) the power set V(A x B), 

(b) the set of all Galois connections between V(A) and V(B), 

(c) the set of all closure dualities, i.e. dual isomorphisms between 
pairs of closure systems on A and B, respectively. 

As mentioned earlier, an isomorphism between (a) and (b) is given by 
associating with every relation R C A x B the pair (R~^,^R), and 
an isomorphism between (b) and (c) is obtained by restriction to the 
ranges. 

Since Ore means by a “dual topology” merely a closure system contain- 
ing the empty set as a member, he has to insert minor modifications 
in order to describe dual pairs of “topologies” by means of relations, 
namely by those relations which have the property that no y E B 
satisfies A = R(y), and no x £ A satisfies Y = R*(x). By reasons of 
analogy, we call a relation with that property unbounded, because in 
the case of order relations, it just excludes the existence of greatest 
and least elements. Let us summarize some of Ore’s results about 
specific relations and Galois connections. 

Correspondence Between Relations and Closure Dualities 

Under the bijection between relations and Galois connections, 

(1) the complements of bijections correspond to perfect Galois con- 
nections between power sets, 

(2) the unbounded relations correspond to dual isomorphisms be- 
tween closure systems with closed empty set, 

(3) the symmetric relations correspond to the involutions of closure 
systems, 

(4) the symmetric anti-reflexive relations (graphs) correspond to or- 
thogonalities of closure systems. 

Ore points out (without formulating explicitly the isomorphism) that 
(4) “gives an actual method for the construction of all complete struc- 
tures which have a polarity ”. 

Concerning (2), it should be mentioned that if one wants to charac- 
terize dualities between topological closure systems in the usual sense 
(closed under finite unions), one must restrict the representing rela- 
tions by the additional condition that all sets of the form R(b\) URfo) 
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and R*(ai) U R*(a 2 ) be closed, which explicitly means that if neither 
cRbi nor cRb 2 then there is a b with xRb whenever xRbi or xRb 2 , 
but not cRb, and dually. R.-E. Hoffmann [75] has observed that a 
topology (in the usual sense) has a “dual”, i.e. , is dually (lattice) iso- 
morphic to another topology, iff it has a base of strongly connected 
open sets, where a nonempty subset U is strongly connected (Levine 
[88]) iff for all open V, W, the inclusion U C V U W entails U C V or 
U C W. Thus, under the isomorphism between relations and closure 
dualities, the above type of relations are in one-to-one correspondence 
with dual isomorphisms between topologies (having a base of strongly 
connected sets). 

In the final section, Ore touches on a concept he calls Galois theory for 
relations. Today, one is familiar with Galois theories for any sort of au- 
tomorphisms. In the specific setting of a relation R on a set S, an auto- 
morphism of (S, R) is a bijection a : S — >• S with x Ry <£>• a(x)Ra(y), 
thus generalizing the notion of automorphisms for order relations. As 
in all similar situations, the automorphisms for R form a subgroup 
Gr = G(S, R) of the full permutation group on S. Again, the fix 
relation induces a Galois connection ($, \f r ), where <f>(A) is the set of 
all elements fixed by each member of X C Gr, and \k(T) is the set of 
all automorphisms fixing each member of Y C S. 

Ore focusses on the specific case of an equivalence relation E and the 
associated partition P E . Here, G E is a direct product of groups G EjK , 
consisting of automorphisms of the union S E>K of all blocks of size k 
( “complete monomial groups” in Ore’s terminology [105, 106]). Then 
he describes the Galois-closed subsets of S. 

The paper concludes with some remarks on general Galois theory, 
starting with some group G of permutations of a set S and making use 
of the polarity associated with the fix relation. Ore first considers the 
case where G acts transitively on S. Here, the Galois-closed subgroups 
of G are the intersections of conjugates of one fixed stabilizer. The 
general case is then settled by partitioning the set S into its orbits 
under the action of G. 

Later generations have developed a broad spectrum of instances of 
polarities or more general Galois connections for groups, rings, alge- 
bras, topological spaces, and many other structures. But Ore’s basic 
concept of Galois connections remained unchanged until today. 
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3.4 Adjunctions as Galois Connections of Mixed 
Type 

During the forties and fifties, many pieces of the Galois connection 
puzzle have been collected together by various authors, not all of which 
can be cited here. But some of these contributions deserve explicit 
mention. 

Everett’s early study from 1944 (see [54]) has already been discussed 
in connection with Birkhoff’s and Ore’s work. 

In 1948, G. Nobeling started one of the first systematic attempts to 
develop a “pointfree” topology by means of lattice theory, initiated in 
the thirties by Menger [97] and Ore [38], and culminating in the 1954 
monograph Grundlagen der Analytischen Topologie [101]. His earlier 
note [100] on that subject is one of the first sources where adjoint pairs 
of maps (rather than Galois connections) between complete lattices 
occur explicitly: Nobeling considers “regular homomorphisms” , i.e. 
maps (p : P — > Q preserving arbitrary joins, and their “ inversions ”, 
i.e. right or upper adjoints 'ip = ip A , which preserve arbitrary meets 
and are given by 

¥> A (?) = V(P € P : (f(p) < q}. 

The original map is then determined as the left or lower adjoint xp v of 
ip by the dual equation 

i> y (p) = A{? e Q : ^(?) > p}- 

As mentioned in the first chapter, the “covariant” counterpart of Ore’s 
system of axioms for Galois connections, obtained by reversing one of 
the partial orders (but not both!) leads to the definition of adjoint 
pairs of maps between arbitrary posets. Another early explicit occur- 
rence of such a system is found in Benado’s 1949 paper [11]: 

V < P 1 => A(p) < A(p'), 
q < q' => p{q) < e{q '), 
p < q(Kp)), x {q{q)) < Q- 

Benado speaks in his “intercalation theory” for functions of “con- 
nexions monotone d’espece mixte” (“monotone connections of mixed 
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type”). 

Also of relevance for the development of an abstract adjunction theory 
is Pickert’s note Bemerkungen uber Galois-Verbindungen [114] from 
1952. Being an expert in geometry and its foundations, Pickert was 
interested, like Birkhoff, Menger and others, in the lattice-theoretical 
background of geometrical constructions. Maybe his note [114] is the 
first one where it is stated explicity that a Galois connection between 
partially ordered sets is already determined by each of its partners, 
and that a map between complete lattices is the partner of a Galois 
connection if and only if it transforms joins into meets: 

ip(yx) = MX). 

(We know that the “only if” part holds for arbitrary posets and all 
existing joins.) Moreover, Pickert shows that a map 

p : P — y Q } x i — y x* 

between arbitrary posets is the partner of a Galois connection iff 
it is antitone, i.e. x x < x 2 =>- x x * > x 2 * , and each of the sets 
{x £ P : x* > y} with y £ Q has a join yt with image yt* > y. 
If that is the case then yt is even the greatest element of the set 
{x £ P : x* > y} and the image of y under the second map of 
the Galois connection. After those general remarks, Pickert turns to 
two specific situations, one motivated by projective spaces, and the 
other dealing with the fundamental possibilities of defining topologi- 
cal spaces. Similarly as in Ore’s considerations on continuous maps, 
these examples lead to adjunctions (hence to axialities ) rather than 
Galois connections (polarities) between power sets. Pickert makes use 
of the fact, obtained immediately by inverting the inclusion order of 
one power set, that the left (respectively, right) adjoint maps between 
power sets are just those which preserve arbitrary unions (respectively, 
intersections) . 

Pickert’s geometrically inspired construction begins with two sets M, 
I and the set M 1 of all functions from / into M (here, I is regarded as 
an index set and not an incidence relation, so confusion with the intent 
M 1 of formal concept analysis is unlikely) . In the specific case where 
M is the set of all points in a projective space, I = {1, 2, 3}, and C C 
M 1 is the set of all collinear triples, the lines are just the maximal sets 
X C M with X 1 C C. Generally, setting y* = {X C M : X 1 C y} 
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for any subset T of M 1 , one obtains a map 3^ i — >• 3^* from V(M I ) into 
the “double power set” VVM that preserves arbitrary intersections. 
Hence, it is one partner of a Galois connection between the dual of 
V(M I ) and VVM or, otherwise stated, the right adjoint of a map 
in the opposite direction, sending any system X C VM to the union 
X t = U(^ J : A" £ X}. Apparently, this adjunction is an axiality and 
comes from the relation R between VM and M 1 defined by 

XRy y E X 1 . 

The second topic concerns the definition of topological spaces by cer- 
tain neighborhood systems on a set M. These are families y = (To : 
a E M) of subsets oiVM satisfying the following axioms: 

(Nl) T e To => a eY 

(N2) Y E To and Y C Z C M => Z E To 

(N3) Y E To =» 3X e To(H G fli^x :xeX}) 

and they are topological neighborhood systems if in addition 

(N4) Y,zey a => r n z g To- 

Pickert shows that the known one-to-one correspondence between such 
topological neighborhood systems and topologies (due to Hausdorff) 
comes from a suitable adjunction: 

Theorem on Neighborhood Systems and Topologies 

For any set M , the map 

A : VVM ^(VVM) M , X <->■ X* = ({T : 3AgT (a GlCh)} : 
a EM) 

preserves arbitrary unions and is therefore a left adjoint. The Galois- 
closed subsets of VM are precisely the kernel systems on M , and the 
Galois- open members of (VVM) M are precisely the neighborhood sys- 
tems. Under the induced isomorphism , the topological neighborhood 
systems correspond to the topologies on M . 

We omit the proof details but add the remark that the above ad- 
junction may be regarded as an axiality: just replace any family 
T = (To : a £ M) by the relation {(Y,a) : Y E Ta> a € M}. This 
yields an isomorphism between (VVM) M and V(VM x M). Now, the 
relation N C VM x (VM x M) defined by 
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X N (y, a) 4* aeX CY 

gives rise to an axiality between VVM and V(VM x M) with 

-+N(X) = {(!» G VM x M : 3X G X(a GIC7)} and 

N<~(y) = {X EVM :aeX CY^(Y,a) ey}. 

In case X is a topology, the first equation just amounts to the usual 
definition of neighborhoods, while the second leads to the definition 
of open sets as neighborhoods of their points, interpreting (Y, a) G y 
as “Y is a neighborhood of a” . 

A rather systematical investigation of Galois connections ( “Galois- 
Korrespondenzen vom Hiillentypus ”) and of adjunctions ( “Galois-Kor- 
respondenzen vom gemischten Typus”) was carried through by Jurgen 
Schmidt in his 1953 paper Beitrage zur Filtertheorie II [123]. This pa- 
per also contains applications to various set-theoretical constructions 
involving filters, closure systems and topologies, and is partly based on 
his Ph. D. dissertation dated 1952 [122]. Schmidt had collected care- 
fully the material of other authors on that theme and cited, among 
others, Benado [11], Birkhoff [3], Everett [54], Pickert [114], Nobeling 
[100], Ore [38], and Riguet [119]. Schmidt formulates the whole the- 
ory in the set-theoretical language, with inclusion as order etc., but 
he emphasizes that all conclusions remain valid in complete lattices, 
and many of them even in arbitrary partially ordered sets. Schmidt’s 
style is sometimes quite narrative (and a bit long-winded), and the 
formal parts are often supplemented by philosophical and historical 
comments. Let us try to report some of his main ideas and results 
in condensed form. After an introduction to the theory of relations 
(pointing out similarities and differences to Ore’s theory), Schmidt re- 
calls some of the basic facts about Galois connections due to Ore and 
Pickert (in particular the fact that one partner is determined by the 
other) and adds the succinct description by the equivalence 

p < ip(q) O q < ip{p). 

Schmidt extends Ore’s characterization of Galois connections by means 
of dual isomorphisms between closure ranges to the setting of arbitrary 
ordered sets, as described in Section 1.3. On account of that result, 
Schmidt chooses the term “Galois correspondences of closure type”. 
For him (and others), the main task in the investigation of concretely 
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given Galois connections is a “good” intrinsic characterization of the 
Galois-closed elements or sets, that is, of the two ranges associated 
with the closure operations obtained by composition of the partners 
of the connection. 

Next, he turns to the (covariant) adjunctions ( “Galois correspondences 
of mixed type”), for which he notes, besides Benado’s system of axioms 
[11], his own characterization 

(— ) (p(p) <q & p< if(q) 

which today is the usual one. Moreover, he characterizes the left 
(respectively, right) adjoint maps between complete lattices as those 
which preserve arbitrary joins (meets). For the partners of an ad- 
junction, he uses the names “Untere Finalkonjugierte” and “Obere 
Initialkonjugierte” . Apparently, these names allude to the special sit- 
uation of continuous maps, which induce an adjoint pair of maps be- 
tween the lattices of closed sets: the lower adjoint sends any closed 
set to the closure of the image, and the upper adjoint takes preimages 
of closed sets (see the previous section). 

Schmidt recalls the one-to-one correspondence between relations and 
polarities, due to Birkhoff, Ore and Everett. But he does not leave 
it at that but discusses systematically the covariant version nowadays 
called axialities (see Section 1.4 and [15]). Both the covariant and the 
contravariant form of adjunctions remain a guiding theme for Schmidt 
during a long period of his work (for an example of more categorical 
flavor, see [125]). 

The second part of the paper [123] focusses on the theme indicated 
in its title, namely applications of Galois connections to the theory of 
filters (and topological spaces). It appears adequate to mention here 
a few more examples from topology, partly contained in Schmidt’s 
paper, partly due to other authors (the true origins cannot always be 
localized; see also [15]). 

(1) Given a set X, define a relation R between A = V(X) and B = XxX 
by UR(y,x ) x G U implies y EU . 

The Galois-closed subsets of A with respect to the associated polarity 
are precisely the Alexandrojf topologies on X , i.e. those kernel systems 
which are at the same time closure systems. On the other hand, the 
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Galois-closed subsets of B are precisely the quasi-orders on X. Thus, 
one obtains in a straightforward way the known dual isomorphism 
between the lattice of Alexandroff topologies and that of quasi-orders 
on X (see [4]). 

(2) Let F(X) denote the set of all filters on X, and put A = V(X), 
B = F(J)xX. Similarly as before, consider the relation R C Ax B 
with 

U R (X, x ) 44- x G U implies U G T . 

This time, the Galois-closed subsets of A are just all topologies, while 
the Galois-closed subsets of B are the topological convergence rela- 
tions. This leads to the well-known equivalent description of topologi- 
cal spaces by open sets or by convergence (cf. [22, 114, 135]). Example 
(1) is obtained by restricting the codomain of the present relation to 
principal ultrafilters x = {T Cl : xEY}. 

(3) For the symmetric relation UAV U f) V 7^ 0 on a power set 
V(X), the associated polarity induces an involutorial dual automor- 
phism of the lattice S(.X’) of all stacks (subsets U oiV(X) such that 
U 6 U and U C V C X imply V € U). Under that involution, the 
lattice of filters is dually isomorphic to to the lattice of grills [22] (i.e. 
complements of set ideals), and the fixed points of that dual isomor- 
phism are the ultrafilters. 

(4) Now, take the complementary relation UXV U fl U = 0 on a 
power set V{X), or more generally, on a topology T C V(X). In that 
situation, the Galois-closed subsets of T turn out to be the principal 
ideals {U € T : U C V} generated by regular open sets V (i.e. sets 
that coincide with the interior of their closure). Thus, the lattice of 
Galois-closed sets is here isomorphic to the complete Boolean lattice 
of all regular open sets (cf. Glivenko [65], Birkhoff [3] and Frink [55]). 

(5) Finally, Schmidt constructs, via a suitable axiality, an isomorphism 
between the filter lattice and a certain topology on the set of all ultra- 
filters. But this construction is a special instance of the isomorphism 
between any spatial frame L, that is, a complete lattice in which every 
element is the meet of a subset of the set Sp(L) of all meet-primes, 
and its hull-kernel topology or spectral topology, consisting of all sets 

a = {p £ Sp(L ) : a ^ p} (a E L ) . 




94 



M. Erne 



Clearly, the isomorphism sends a to a, and its inverse sends any open 
set U to f\{L\U). If L happens to be a closure system on a set A (as 
in the case of the filter lattice F(X)), then this perfect adjunction is 
induced by the axiality corresponding to the relation 

R = {(x,P) e AxSp(L) :x(£P} 

(cf. the analogous construction of spectral topologies in Section 2.8). 

4 Residuation Theory 

Recall that a map between ordered sets is residuated if inverse images 
of principal ideals are again principal ideals, and that residuated maps 
are just the same thing as left adjoint maps, in other words, maps that 
have a right adjoint. A residuated groupoid is a set S equipped with a 
binary operation A : S' x 5 — >■ 5 and a partial order (sometimes only 
a quasi-order) such that each of the left translations 

x X: S ^ S, t/4 A (x,y) 

and each of the right translations 

A y : S — x S , x i — Y \{x, ?/) 

is residuated. More generally, a partially ordered (general) algebra is 
said to be residuated if all translations, obtained by fixing one coor- 
dinate in the basic operations, are residuated. 

In order to prevent misunderstandings, it should be pointed out that a 
binary operation A : PxP^Fon a poset is almost never residuated 
(when P x P is endowed with the componentwise order). For example, 
the binary meet operation A : L x L A I of a lattice L with more 
than one element does not even preserve binary joins, hence cannot 
be residuated: indeed, choosing a ^ b in L, one obtains 

A((a, b ) V ( b , a)) = A((aV b, 6V a)) = aVb aAb = ( aAb ) V (brta) = 

A (a, b) V A (6, a). 

Algebraic residuation theory originates from at least two sources: 

1. The investigation of the “laws of thought”, due to Boole and com- 
pleted by Schroder. Boole’s vague observation that the fundamental 
operations of joining and intersecting should be somehow ’’invertible” 
was made precise by Schroder who invented the concepts of “minimal 
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difference” and “maximal quotient” , which are close to the residuals 
for translations coming from the lattice operations of join and meet. 
The first section of this chapter is devoted to these ideas. 

2. The investigation of relationships between ideals of rings and frac- 
tional ideals of fields, as developed by Dedekind in his famous Sup- 
plement XI to Dirichlet’s Lectures on Number Theory [27]. It would 
require too much space to survey Dedekind’s ideal theory here. In- 
stead, we shall concentrate, in the second section of this chapter, on 
his basic (and more general) ideas concerning partially ordered groups. 

In the third section, we shall briefly review Krull’s pioneer work on 
abstract ideal theory. In the fourth section, we sketch the theory of 
residuated lattices, due to Ward and Dilworth. The fifth section con- 
tains a short survey of more recent aspects of residuation theory as 
presented in the monograph by Blyth and Janowitz [16]. Specializa- 
tion to locales , i.e. quantales whose multiplication is the binary meet, 
leads us to the current vital theme of pointfree ( but not pointless) 
topology. 

4.1 From Boole to Schroder: The Logical Calculus 

To ”/ag avro ap,av7r agxstv re nal fif] VTrdgxem 
advvarov 

toj avTui Kal Kara to avro ... 

Avrr] 8fj iracrujv earl (3sf3cau)TaTri tujv agx&v... It is 
impossible that the same quality should both belong and not 
belong to the same thing ... This is the most certain of all 
principles ... Aristotle, Metaphysica, III. 3. 

Quoted by George Boole in The Laws of Thought (1854) 

George Boole is known as the pioneer in the development of formal 
logic and its links to algebraic operations. Though the mathematical 
theory of Boolean algebras carries his name, he did not invent exactly 
that algebraic theory (see below) , and a great part of his work is of a 
more philosophical than a purely mathematical nature. In his essay 
from 1847, “The Mathematical Analysis of Logic” [17], and later in 
the famous monograph from 1854, “An Investigation of the Laws of 
Thought” [19], Boole developed a broad spectrum of ideas with great 
influence on later investigations of the mathematical foundations of 
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logical thinking, and so of lattice theory. It is not the place here 
to report Boole’s reflections in adequate length, but a few aspects 
pointing to certain rudiments of adjunctions should be mentioned in 
our historical expedition. For a comprehensive discussion of Boole’s 
theories of logic and probability, we recommend the monograph by T. 
Hailperin [69]. 

Boole’s style is a mixture of syntactic conclusions and semantic heuris- 
tics, and sometimes of a priori postulates with a posteriori justifi- 
cations of certain rules and methods by a wealth of examples. Let 
us try to abstract the main mathematical postulates and deductions 
from his work (without going into details of semantic interpretation) . 
Boole introduces two operations (connecting certain objects of think- 
ing): the “combination” of x and y, written simply as juxtaposition xy 
and describing the “class of things to which the names or descriptions 
represented by x and y are simultaneously applicable”] and the “ag- 
gregate” or “conjunction” of x and y, written additively as x + y and 
describing the class of all things that belong to x or to y (respectively, 
have one of the two properties described by x and y). Two aspects of 
Boole’s reasoning should be pointed out: first, for him, such “sums” 
are meaningful or “interpretable” only if the summands are disjoint, 
i.e., no object has the properties x and y in common. Thus, one is 
tempted to say that, in contrast to the total operation of “multipli- 
cation” xy, Boole’s “addition” x + y is only a partial operation. This 
already shows that a formalized theory comprehending Boole’s basic 
ideas is not Boolean algebra in our sense (where the addition is a to- 
tal operation, too). From a modern perspective, roughly speaking one 
might say Boole considered that fragment of Boolean algebra in which 
the join operation of Boolean lattices coincides with the addition of the 
associated Boolean rings, introduced and studied much later by M. H. 
Stone [137]. But probably this is not what Boole had in mind: he freely 
carried through calculations involving terms containing sums that vio- 
late the disjointness postulate, calling them “not interpretable”, until 
he arrived at a term that is “logically interpretable” . This is the most 
controversial part of his theory, appreciated by some commentators as 
“highly ingenious” , by others as “dark and mysterious” . 

The second, somewhat surprising aspect is that Boole paraphrases the 
addition both by the word “and” (applied to substantives, as in his 
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example “men and woman”), but also by the exclusive “or” (applied 
to adjectives, as in “beings that are men or women”). At first glance, 
this simultaneous use of the words “and, or” seems to conflict with 
our interpretation of the meet (intersection) described by the particle 
“and”, while join (union) is described by the particle “or”. The in- 
tuition that “and” is in some sense dual to “or” was made precise by 
Schroder, who distinguished between extent and intent of a concept 
(see Sections 1.5 and 2.6); here, the conjunction of attributes (form- 
ing the intent) corresponds to the intersection of the classes of objects 
having these attributes (the extent). As indicated in Section 1.5, this 
is a central aspect of formal concept analysis [59]. 

For the basic two operations, Boole explicitly postulates the law of 
commutativity, while the law of associativity is not mentioned at all 
(perhaps because juxtaposition and conjunction of more than two 
members usually are written or phrased without parentheses). As 
a fundamental law of logic, Boole emphasizes the law of idempotency 
xx = x, written as x 2 = x (and referred to as the “Law of Duality” by 
Boole himself ; however, that terminology is not very suggestive and 
was criticized by Schroder [129]). Thus, the Boolean multiplication 
(juxtaposition) obeys the identities valid in what nowadays is called 
a semilattice. For Boole ’s addition, however, the dual law x + x = x 
plays no role, in view of his hypothesis that the classes occurring in an 
“interpretable” sum have to be disjoint. The next fundamental axiom 
is the distributive law 

z(x + y) = zx + zy, 

which Boole motivates by examples. Then he introduces a third (par- 
tial) operation he denotes by the subtraction symbol From the 

point of adjunctions, this operation is of particular interest. Boole’s 
own comment: 

But the very idea of an operation affecting some positive 
change seems to suggest to us the idea of an opposite or 
negative operation, having the effect of undoing what the 
former one has done. 

Accordingly, Boole gives the interpretation “except” to the subtraction 
symbol ” . Shortly after, he states that from x = y + z one may 
deduce the equation x — z = y: again the idea of shifting parts of 
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an equation from one side to the other! One should remark that the 
equivalence 

(%) x — z = y ■*=> x — y + z 

holds only under the assumption that “z is subsumed under x ”, which 
may be described by any of the equivalent identities 

z = zx, z — zx = 0, z( 1 — x) = 0, 

using Boole’s symbols 0 for “Nothing” and 1 for “the Universe”, sub- 
ject to the rules 

Os = 0 and lx = x. 

But Boole prefers to choose an “indetermined class variable” v and 
expresses the above situation by the equation 



z = vx. 

Dealing with equalities exclusively, Boole does not mention the so 
defined order of subsumption, all the less the symbolic notation z < 
x. Nevertheless, the equivalence (%) may be regarded as a special 
instance of an adjunction, viz. for the equality relation as order. A bit 
problematic is the domain of definition for the operations involved in 
that adjunction. If one wishes to avoid partial operations, one has to 
consider, for fixed z, the mutually inverse isomorphisms 

x i — y x — z and y i-> y + z 

between the intervals 

[z, 1] = {x : z < x} and [0, 1 — z] = {y : y < 1 — z} . 

Of course, this is not only a (perfect) adjunction with respect to the 
identity relation = , but also with respect to the order relation <. 
An alternate (but not Boole’s) point of view is to regard y + z as the 
join and x — z as the relative complement xz' in a Boolean algebra B. 
Then the equivalence 

x — z < y 44 x < y + z 

shows that we have an adjoint pair of maps x x — z and y ^ y + z, 
defined on the whole of B, whose surjective restrictions yield the above 
mutually inverse isomorphisms. A third (and perhaps the most con- 
vincing) alternative would be to take for + the addition and for — the 
subtraction in a Boolean ring, in which case addition and subtraction 
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of a fixed element again are mutually inverse isomorphisms, this time 
defined on the whole ring. 

Boole claims that one must have not only the distributive law for the 
addition but also that for the subtraction, 

z(x — y) = zx — zy. 

And it is not difficult to observe that the equivalence (^=?) together 
with distributivity of the multiplication over addition formally (not 
only intuitively) entails distributivity over subtraction: under the as- 
sumption that y is subsumed under x, the equivalence (%) yields 

x = (x — y) + y and then z(x — y) + zy = z((x — y) + y) = zx, 

hence 

z(x — y) = zx — zy. 

A central point in Boole’s reasoning is the fundamental law of contra- 
diction, 

x{l — x) = 0 , 

for which he quotes the sentence by Aristotle at the beginning of this 
section, and which he deduces from the law of idempotency, written 
in the form x — x 2 = 0 or xl — xx = 0. The law of contradiction is 
then the starting point for Boole’s famous construction of the normal 
( “developed”) form of any Boolean expression, involving an arbitrary 
number of variables, composed by multiplication, addition, subtrac- 
tion and division. Explicitly, Boole notes the result for functions in 
one, two or three variables, in particular: 

f{x,y) = f{l,l)xy + f(l,0)x(l-y 

+ /(0, l){l-x)y + f{0,0){l-x){l-y). 

In the subsequent chapters, Boole makes extensive use of the Normal 
Form Theorem in his methods of reduction and solution of logical 
equations and the elimination of variables. Although these methods 
doubtless are ingenious and have kept their importance until today, 
a rigorous demonstration of their validity is often missing in Boole’s 
original work. 

Apparently, Ernst Schroder was the first to succeed in putting the 
whole theory, now called Boolean algebra, on an axiomatic base allow- 




100 



M. Erne 



ing for exact proofs and conclusions. In the preface to his early 1877 
treatise “Der Operationskreis des Logikkalkuls” [129] he writes (free 
English translation): 

The Boolean theory still suffers from certain incomplete- 
ness deficiencies. Among the lacks in Boole’s otherwise 
most admirable and attractive method I have remarked, 

I would like to mention that Boole includes an entirely 
alien element into his investigations ... the whole ballast 
of algebraic numbers. Invoking the latter indeed forces one 
to give up the interpretability of the intermediate steps of 
calculation, because one must compute with symbols like 
2, — l,|,jp which do not admit any logical interpretation. 

While it remains a hopeless undertaking to understand the 
meaning of each executed single operation, one is led, in a 
surprising but mentally not satisfactory way, to the desired 
and, indeed, in its own fashion correct result. 

In fact, Boole also considered rational functions containing quotients, 
which, superficially, do not admit any interpretation by logical vari- 
ables. For example, in order to solve the equation 

x = yz 

for z, he wrote down the developed form 

z = l = xy + jjz(l - y) + 0(1 - x)y + § (1 - x)(l - y) 

claiming that the quotients and ^ would have a “very important 
logical interpretation”, namely: 

... The symbol jj indicates that a perfectly indefinite portion 
of the class, i.e. some, none or all of its members have to 
be taken. Any other symbol [ than 0, 1 or jj] as coefficient 
indicates that the constituent to which it is prefixed must 
be equated to 0. It follows hence that if the solution of a 
problem, obtained by development, be of the form 

w = A + 0 B + [j C + ^ D , 

that solution may be resolved into the following equations, 
viz. 
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w = A + v C, D = 0. 

In the above example, this leads to the correct solution 

z = xy + w(l — x)(l — y) , x{\ — y) = 0 

with an indeterminate variable v. Of course, the simpler equation 
z = x + u(l — y) together with x(l — y) = 0 would work as well. 

In a metaphor, one might say that Boole virtuously juggled with the 
balls of logic, throwing them from one hand to the other but not caring 
much about what happened with them in the air, as long as he was 
able to have the desired result in hand at the end. On the contrary, 
Schroder carefully observed the balls during their flight through the 
air. 

In his “Operationskreis” [129] Schroder presents a semi- formal system 
for Boolean logic, consisting of the “commutation laws” and “asso- 
ciation laws” for the class operations of “sum” and “ product ” (inter- 
pretable either as logical or as set-theoretical operations), the laws of 
idempotency, one “distribution law” (the “obvious one” known from 
ring operations), and the rules aa' = 0 , a + o' = 1 for the existence 
of complements a' (Schroder writes oq instead of a' and calls it the 
negation of a). Concerning the four rules 

0 a = 0 , 1 ~ ~ u --- 1 

la = a , 0 + a = a 

Schroder carefully notes that the first two of them follow from the as- 
sociation laws, the idempotency laws and the complement laws, while 
he believes that the other two cannot be derived from the axioms, 
though being mutually equivalent, on account of the equations 

la = al = a(a + a') = aa + aa' = a + 0 = 0 + a. 

And he is right: taking for addition and multiplication the same semi- 
lattice operation, one gets 0 = 0a = 0-l-a = a, which is impossible if 
the semilattice has more than one element (but this argument is not 
Schroder’s). 

Besides many further equations valid in Boolean logic, he derives the 
absorption laws 

a + (ab) = a and a(a + b) = a 
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from the distribution law and the rules for the universal bound 1: 
a(a + b) = aa + ab = a + ab = al + ab = a(l + b) = al = a ; 
and now he concludes that the dual distribution law must hold, too: 
(a + b)(a + c) = aa + ab + ac + be = a + ac + be = a + be. 

After having proved and applied de Morgan’s laws 

(a V by = a' Ab 1 and ( a A b )' = a' V b' 

he states in conclusion that (in contrast to Boole’s original system) 
the whole calculus obeys the rule of dualization : every true formula 
remains valid when sum and product are exchanged at all occurrences 
in the formula. 

For the historical origins of residuation theory, the fourth section of 
the “Operationskreis” is of major relevance. Schroder points out that 
in the logical calculus with distribution and complements, the dual 
equations 

x + b = a and xb = a 

if solvable, do possess certain extremal solutions. Namely, x + b = a 
is solvable if and only if a'b = 0, in which case the largest solution 
is x = a, while the smallest solution is the “minimal difference ” x = 
a — b = ab' . Dually, the equation xb = a is solvable iff ab 1 = 0, and 
then the smallest solution is x — a, while the largest solution is here 
the “maximal quotient ” x = | = a + b' . 

Schroder derives a whole series of rules for these two “principal values 
of subtraction and division but he does not proceed explicitly to 
the final idea of residuation theory, namely to search for the extremal 
solutions of the inequalities x + b > a (which could be written as an 
equation, e.g. (x + b)a = a) and xb < a (equivalently, xb + a = a) 
which always exist and are given by x = ab', respectively, x = a + b'. 

In his later work, Schroder singled out the absorption laws as “partic- 
ularly characteristic for the logical calculus” and included them into 
the list of axioms but dropped the distribution and complement laws 
from the axiomatic framework of his “logical calculus”. He was familiar 
with the early (and fragmentary) set- and lattice-theoretical ideas of 
the Grassmann brothers [66, 67], who already used the symbols U and 
D and emphasized their dual behavior, in particular the equivalence 
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of the equations 

a n b = a and a U b = b , 

both expressing that a be “included” in or “subsumed” under b. In 
his extensive Lectures on the Algebra of Logic [131], Schroder often 
returned to that subsumption relation as a basic concept for logic (or, 
interpreted as set inclusion, for the theory of classes). Curiously, as 
a forerunner of the inclusion symbol C, he chose the Euro symbol =€ 
for subsumption. Comparing the two pioneers of lattice theory with 
each other, one might say that Dedekind preferred the operational ap- 
proach, while Schroder later emphasized the relational (hence order- 
theoretical) tools. His basic principles for the subsumption are reflex- 
ivity and transitivity; instead of demanding the law of antisymmetry, 
he defines equality by the equivalence 

a — b: a ^Lb and b =€ a 

(later a common use in set theory). The existence of a least element 0 
and a greatest element 1 are postulated verbally (rather than by the 
formulas 0 =€ a and a =€ 1). The definitions of (binary) join and meet 
are again divided into a formal part: 

if c =€! a and then c =€! ab, 

if a c and b =€ c then a + b =€ c, 

and a verbal part, describing ab and a + b in the usual naive set- 
theoretical sense of intersection and union by means of the word “Ge- 
biet” ( domain , field, area). The “logical calculus ” defined in that 
semi-axiomatic way apparently is what later would become the order- 
theoretical definition of (bounded) lattices. In order to obtain the full 
“identical calculus ” (of Boolean algebra), Schroder adds a weak form 
of the “distribution law”: 

if be = 0 then a(b + c) ab + ac 

and the defining condition for complements 

aa' =€ 0 and 1 =€ a + a' 

together with the postulate that for each a at least one such a' exists, 
“obtainable as remainder by omitting a from 1” . 

Schroder knew and appreciated Peirce’s attempts to formalize logi- 
cal thinking [110]. A purified list of order-theoretical postulates for 
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lattices (as formal systems of logic) was given later by Huntington 
[76], however only in connection with additional postulates defining 
Boolean algebras. One of these axioms is the following “syllogistic 
principle” used by Peirce [110]: 

if a =€ b' is false then there is an x ^ 0 with x =€ a and x^Lb, 

which together with the postulate bb' =€ 0 (and the meet properties of 
ab ) is easily seen to be equivalent to 
ab = 0 44- a^b 1 ab' = a, 

the later characterization of pseudocomplements in the sense of Glivenko 
[65] and Frink [55]. This is now a very interesting axiom, both in view 
of the historical controversy about Schroder’s distributivity postulate 
(provable or not?) and the early occurrence of an adjunction. In fact, 
the equivalences 

a&=€[ c" 4=> ba^i c" 4=> bad = 0 <=>■ b =€ (ad)' 

show that under the assumption c" = c (which follows from uniqueness 
of complements and was derived by Schroder from the distribution 
law), each of the unary meet operations 

A a : b !-»■ ab 

is left adjoint to the “relative pseudocomplementation map” 

(ad)'. 

And by the general fact that left adjoint maps preserve joins , this 
yields the distribution law 

a(b + c) = ab + ac 

and moreover its infinitary variant for all existing joins. The same clue 
may be used to give a very short proof of the famous Glivenko-Frink 
Theorem on the 

Booleanization of Pseudocompletemented Semilattices 

The skeleton 

S' = { a ' : a e S} = {a G S : a" = a} 

of a pseudocomplemented semilattice S is always a Boolean lattice 
(with o' as complement of a, ab as meet and and ( a'b ')' as join of 
a and b). 

(Compare the concise argument above with the rather complicated 
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proof of the distributive law in [68].) 

Using the aforementioned “syllogistic principle”, Peirce had deduced 
the distributive law; later, corresponding with Schroder, he admitted 
that the conclusion would be false, because of Schroder’s counterexam- 
ple (see Section 2.6). But later still, Peirce added a marginal remark 
saying his proof was not an error at all. Again half a century later, 
in a footnote of [3], Birkhoff stated that “Schroder corrected the er- 
roneous impression of Peirce that all lattices were distributive”. But, 
as Mehrtens [92] has pointed out, Peirce did not deal with any axiom 
system of general lattices but based his conclusion on the above “syllo- 
gism”. Nevertheless, Schroder certainly was right when he claimed to 
have shown that the distributive law does not follow from the “logical 
calculus” , that is, from the axioms of lattice theory. 

Let us summarize in a nutshell the role of adjunctions and Galois 
connections in the early lattice-theoretically flavored logical investiga- 
tions: 

Logical Rules Derived from Adjunctions and Galois Connec- 
tions 

( 1 ) Any partner xjj : x h ->- x' of a Galois connection on a lattice (e.g. 
any pseudocomplementation) converts joins to meets, hence satisfies 
de Morgan’s law 

(x V y)' = x' A y ' , 

while the dual law does not follow in general. 

(2) Any left adjoint (residuated) unary operation y i— >• ay obtained from 
some binary operation ( x , y ) h - > xy by fixing one argument preserves 
joins, hence satisfies the distributive law 

a(x V y) = ax V ay . 

4.2 Dedekind’s Invention of Lattice-Ordered Groups 

Although the name “lattice- ordered groups” or “l-groups” was not used 
by Dedekind, he invented that important modern concept and discov- 
ered its basic properties. Of the many interesting aspects of the later 
extensively elaborated theory of /-groups (see e.g. Birkhoff [3] for a 
small excerpt), we shall touch on those only which occur in Dedekind’s 
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work and provide the origin of the theory of residuated lattices. Al- 
ready in the first (1897) paper on his “Dualgruppen” (see [30] and 
Section 2.4), Dedekind defines abelian groups exactly the same way 
we do, but writes them always multiplicatively. Moreover, he consid- 
ers such groups endowed with an extra semilattice operation, denoted 
by + , and postulates the distributive law 

a(b + c) = ab + ac. 

In order to give a succinct version of Dedekind’s basic theorem on 
such structures, let us (historically) anticipate the notion of a partially 
ordered group, that is, a group G with a partial order such that left 
and right translations are order isomorphisms, hence left and right 
adjoint to the inverse translations: 

ax < y <=? x < a~ l y and xa < y 44- x < ya~ l . 

As in Section 2.5, we write V instead of + in order to avoid confusion 
with other possible additions. Dedekind’s crucial observation is that 

a A b = ab/ ( a V b) 

defines a second operation such that (G, V, A) becomes a lattice (“Du- 
algruppe”). His proof is entirely algebraic, but of course, one may 
also invoke the fact that inversion is a dual automorphism. Hence, 
x ax~ 1 b is a dual automorphism, too, and consequently a{a V b)~ l b 
must be the meet of aa~ 1 b — b and ab~ l b = a. Thus, in the non- 
commutative case, the above equation has to be replaced with 

a A b = a(a V b)~ 1 b 

(see Birkhoff [3]). Now, the non-commutative version of Dedekind’s 
theorem and an extension to infinite joins and meets reads as follows: 

Charcterization Theorem for Lattice-Ordered Groups 

For any partially ordered group G, the following statements and their 
duals are all equivalent: 

(a) The join a V e exists for the neutral element e and each a £ G. 

(b) G is a semilattice with (aV b)c = ac V be. 

(c) G is a semilattice such that c — A B implies 
aV c= f\ {aV b : b £B}. 

(d) G is a distributive lattice ( with the underlying order). 
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Proof, (a) => (b). Being order isomorphisms, the translations pre- 
serve joins. Thus, (( ab ~ x ) V e)b = abb -1 V eb = aV b exists, and (oV 
b)c = ac V be. 

(b) =/* (c). If c = f\ B and x < a V b for all b £ B, then c < b entails 
aV6 = aV6Vc and c < (a V c) A b = (a V c)(a V 6) _1 6 < (a V c)x _1 6, 
hence x(aVc) _1 c < b for all b € B. Consequently x(aVc) _1 c < c, and 
finally x < aV c. This proves the equation aV c=/\{aV b : b El?}. 

(c) =>- (d) => (a) and the self-duality of the statements are clear, be- 
cause inversion is a dual automorphism. □ 



The implication (b) => (d) for commutative groups and the follow- 
ing aesthetic proof is due to Dedekind. Iterated application of the 
lattice identities and the commutative and distributive laws for the 
multiplication yields 

(a V b V c)(ab V 6c V ca ) = (a V b)(b V c)(c V a), 



an identity Dedekind uses at some places in his general ideal theory. 
Now, 



a A (b V c) 



a(b V c) _ a(a6 V6cV ca) _ a ft ac 

a V 6 V c (a V 6) (a V c) aVb a V c 



(a A 6) V (a A c). 



In view of the preservation of all existing joins and meets by the unary 
lattice operations xi-AiVi and a;4flAi, one might guess that they 
could possess adjoints (as the multiplicative translations do). But 
that is not the case, by lack of universal bounds for any non-trivial 
lattice-ordered group. Again, it was already Dedekind who observed 
that such a group cannot possess any elements of finite order (see E. 
Noether’s comment in the 1931 edition of Dedekind’s Collected Works 
[31]). 



4.3 Wolfgang Krull’s Ideal Domains, and Quantales 

In the twenties of the past century, Wolfgang Krull created with in- 
genuity an abstract framework establishing the formal background for 
the ideal theory developed before by Dedekind, Noether and Artin, but 
also for many other mathematical structures (like locales or frames ) 
that became of interest only decades after Krull’s pioneering work. 
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The commutative case is treated by Krull in his 1924 paper [82], and 
the non-commutative case in his 1928 paper [83]. We shall discuss only 
a few basic definitions and facts from the first two chapters of the lat- 
ter, which extends much of the material contained in the former. The 
later chapters, being of minor relevance for the idea of residuation, 
will not be commented on here. Krull’s principal idea is the abstrac- 
tion from concrete ring- and ideal-theoretical situations to a structure 
whose “individuals” are not the elements of a ring, but the “ideals” 
(in a generalized sense) themselves. So he writes in the introduction: 

Wir legen nicht etwa einen nichtkommutativen Ring zu- 
grunde, aus dessen Elementen die in Betracht kommenden 
Ideale gebildet werden; wir gehen vielmehr von einem “Ide- 
albereich” aus, dessen Individuen die Ideale selbst darstellen, 
die filr uns - unabhangig von ihrer sonstigen Eigenart - 
allein durch die filr sie giiltigen Verkniipfungsregeln (Teil- 
barkeitsverhaltnisse, Multiplikation ) charakterisiert sind. Das 
Vorbildfiir eine derartige Behandlung der Idealtheorie findet 
sich bei Dedekind. 

Thus, Krull shows that it is possible to develop the ring-theoretical 
notions of prime (ideal), radical, isolated component etc. in his very 
general abstract setting of “ideal domains”. Though he does not point 
out explicitly the role of adjunctions in his theory, the tool of quotients 
( residuals ) is fundamental and indispensible for a great part of his 
constructions and conclusions. 

Krull axiomatically introduces an ideal domain as a set with a partial 
order (called divisibility relation) such that any two elements a, b have 
a least common multiple (1cm) a fl b and a greatest common divisor 
(gcd) a + b, and moreover, even arbitrary (not necessarily finite) sub- 
sets have a gcd. So far, this just defines a complete lattice in the sense 
of Birkhoff. Furthermore, Krull postulates an associative (but not 
necessarily commutative) extra multiplication that distributes from 
both sides over arbitrary gcd’s and has the “two-sided ideal property” , 
meaning that the least common multiple always divides the product 
of two elements. Of course, this nomenclature is still in the tradition 
of arithmetic and algebra. But the divisibility relation of concrete ide- 
als is dual to set inclusion. Hence, in modern terminology, one would 
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refer to the dual notions and speak of a complete (two-sided) residu- 
ated semigroup if one has a complete lattice with an extra associative 
multiplication that distributes over arbitrary joins (denoted by the 
symbol \f) and is dominated by the binary meet operation: 

a • \/ B = \/(a-B) , \f B ■ a = \/(B- a) , and a-b < a A b. 

A short modern name for structures satisfying the previous two dis- 
tributive laws is quantale, and quantales with the third property a-b < 
aAb are said to be two-sided (see Rosenthal [120] for a comprehen- 
sive discussion of quantales and their applications). From the his- 
torical point of view, it is remarkable that Krull was one of the first 
mathematicians dealing with arbitrary joins (or meets) in an algebraic 
context, but he resisted introducing a symbolic notation for such an 
infinite operation, writing the left distributive law in the following 
form: 

o • (&i + + ■ • ■ ) = a ■ b\ + a ■ &2 + • • • 

Apparently, the “finitary” standpoint in algebraic contexts was quite 
strong at that time and also in the subsequent decades (cf. the various 
editions of van der Waerden’s Algebra [142]). 

The crucial definition (with regard to residuation theory) is now that 
of left quotients b~ l a and right quotients ab~ l by 

b~ l a = \/{ c | be < a} and ab = \/{c | cb < a } . 

Krull notes at once that these suprema (ged’s in his terminology) 
are in fact maxima (least elements with respect to his dual order of 
divisibility), which could also be expressed by the typical adjunction 
equivalences 

ab < c a < cb~ l <=> b < a~ l c. 

Altough the use of the exponent symbol -1 is perhaps a bit dangerous, 
because actual inverse elements rarely will exist, the notation is quite 
convenient and rather stable, on account of the rules 

b~\c~ l a ) — (c6) _1 a, (ac _1 )6' 1 = a(6c) _1 , (6 _1 a)c _1 = b~ 1 (ac^ 1 ), 

so that one could even omit some parentheses and write b~ l ac~ l for 
the last term. Krull notes without proof some further elementary rules 
like 

b~ l (/\A) = f\{b~ l a | a G A}, 
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a{\/B)- l = Mab- 1 | beB}, 

referring to Dedekind’s Supplement XI [27] for the case of modules, 
and to his own paper [82] for the general commutative case. 

In the second chapter, Krull gives an elegant proof of a first “prime 
ideal theorem”. He defines prime ideals (or simply primes) p by the 
usual condition that ab < p implies a < pov b < p but does not require 
explicitly that a prime should be distinct from the top element. The 
primes p with p > a are referred to as the primes of a. (Recall that 
our > is denoted by < in Krull’s paper and is regarded as a divisibility 
relation; accordingly, he speaks of a “highest prime (ideal) of a if he 
means a minimal prime p > a in our sense). The theorem in question 
provides the first step towards the derivation of prime decompositions 
from suitable chain conditions. 

Theorem on Minimal Primes 

Suppose Q is an ideal domain (two-sided quantale) satisfying the fol- 
lowing quotient chain condition: there is no properly descending se- 
quence of elements aj such that aj + 1 is always a left or right quotient 
of aj. Then, for each a £ Q, there is a finite antichain of primes 
Pi , ...,p n > « such that (pi...p n ) r < a for suitably large exponents r. 
These primes are precisely the minimal primes of a. 

Proof As usual, an antichain is a set of pairwise incomparable ele- 
ments; thus, pj < pk is excluded for j ^ k. Krull’s proof is similar 
to the common argument leading to prime decompositions for natural 
numbers or elements of distributive lattices with chain conditions, but 
a bit more subtle. It suffices to show that if an element a fails to 
have the required property then so does a smaller quotient of a (this 
would then produce a properly descending chain of quotients). Un- 
der the contrapositive assumption on a, this element cannot be prime, 
so there exist elements b, c with be < a but b ^ a and c ^ a. The 
quotients oq = ac~ l and a 2 = oq _1 a satisfy aia 2 < a but a < a\ and 
a < o 2 . (Indeed, ac < a implies a < ac -1 = oq, while the assumption 
ai < a together with be < a would entail b < ac~ l = oq < a; and 
similarly, oqa < a implies a < aq _1 a = a 2 , while a 2 < a together with 
aiC < a would entail c < oi _1 a = a 2 < a.) If either aj ( j = 1, 2) would 
have the required property (pj^...pj ynj ) r: > < aj with suitable pairwise 
incomparable primes pj^, then the minimal elements in the union of 
these two antichains would form a finite antichain {pi,-..,p n } with 
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( pi...p n ) ri+r 2 < G 1 G 2 < a, in contrast to the assumption on a. Hence, 
either ai or a 2 fails to have the property in question. That a finite 
antichain {pi, ...,p n } of primes pj > a with {pi...p n ) r < a must contain 
all minimal primes p of a is clear, because (pi-.-p n ) r <P entails pj <p 
for some j and then pj = p by minimality of p. On the other hand, 
each pj must be a minimal prime of a, because a < p < pj implies, as 
before, pk < p < Pj for some k, so that p = Pj = Pk- n 

Under the quotient chain condition, any sequence built by elements of 
the form ab~ n = a{b n )~ l becomes stationary; its finally constant value 
is denoted by %{a) and called a right isolated component (ideal) of a. 
The following equations for the functions % are easily verified: 

ib+c = ^6 A Z c , if) C icb ^bAc 1 

and ib preserves finite meets. Analogously for left isolated components. 
Another useful result, stated as Theorem 2, is the following: 

b r < c implies ib~>i c - 

Indeed, induction gives ab~ n > ac~ n , and for n sufficiently large, 
ab~ n = ib{a), ac" n = i c (a). Applying this to the above theorem, 
Krull obtains the 

Theorem on Isolated Components 

Ifc is a product of all minimal primes ofb (in some order) thenib = i c - 

Adding the hypothesis that there is no ascending chain of primes, 
Krull now develops a purely abstract prime (ideal) theory, covering 
much of the material known from ring theory. 

From the perspective of adjunctions, it might be of interest that Krull’s 
famous Separation Lemma (also in the abstract version for quantales) 
is equivalent to the Boolean Prime Ideal Theorem, as shown in [9] - 
and that the main ingredient in the proof is again a certain residuated 
map. In the classical ring-theoretical setting, the Separation Lemma 
says that 

every ideal disjoint from a non-empty multiplicative subset M is 
contained in a prime ideal disjoint from M . 

For the general quantalic version (which covers the above and many 
other separation lemmas, see [51]), one needs the notion of open m- 
filters] these are subsets S' of a quantale Q such that 




112 



M. Erne 



(1) a,b G S and ab < c imply c G S (m- filter property), and 

(2) every directed set with join in S already intersects S (openness) . 

Separation Lemma for Quantales 

Any element a outside an open m-filter S in a two-sided quantale Q 
has a prime outside S. 

Of course, this is easily shown with the help of Zorn’s Lemma - but 
the point is that it can be derived from (and is therefore equivalent 
to) the 

Boolean Prime Ideal Theorem 

Every non-trivial Boolean algebra contains a prime ideal. 

The main tool in the proof is the map s : Q — >• Q defined by 

s(a) = \J{x GQ \ VyeQ(x\/yeS => aVyeS)}. 

One shows that s is a closure operation on Q whose image Q s = s(Q) 
is a locale (that is, a quantale whose multiplication is the binary meet). 
Hence the corestriction s : Q — > Q s is left adjoint to the inclusion map 
from Q s into Q. Moreover, the top element e of Q is compact in 
Q s , and its preimage .W({e}) is the original open m-filter S. Now, 
the Prime Element Theorem, known to be equivalent to the Boolean 
Prime Ideal Theorem [8], yields a prime in Q s distinct from e, which 
is then also a prime for a in Q outside S. 



4.4 Morgan Ward’s and R. P. Dilworth’s Residuated 
Lattices 

A few years after Krull’s discoveries, independently Morgan Ward and 
his student Robert P. Dilworth dealt with the same theme. Their work 
may be regarded as the first systematic investigation of residuated op- 
erations. They became acquainted with Krull’s work only after they 
had finished a great part of their investigations. The overlap is not 
broad, but of course, residuation plays a common central role in both 
theories. While Krull focussed on notions of primeness and decompo- 
sition theorems, Dilworth and Ward were more interested in general 
questions of existence and uniqueness of residuated multiplications 
and in the consequences thereof. In a retrospective, Dilworth writes: 
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Morgan Ward . . . was also impressed with the usefulness of 
residuation in the study of ideals in a ring. When I began 
my graduate work under his direction he suggested that I 
first make an axiomatic study of residuation over a lattice 
. . . On the basis of this work , he and I became convinced 
that much of the theory of ideals of a commutative ring 
could be carried out for lattices with a multiplication (resid- 
uated lattices). We were able to carry through much of the 
decomposition theory but were unable to get the deeper re- 
sults of ideal theory. The main obstacle was obtaining a 
suitable notion of ‘principal element’ in a multiplicative 
lattice. Our attempts were only partially successful. It was 
many years later (1962) that I discovered the right general- 
ization and began the deeper theory in my paper ‘Abstract 
Commutative Ideal Theory’. 

From a whole series of early papers on the subject (see [35, 36, 146, 
147, 148, 150, 151]) we pick here the one that provided the most signif- 
icant progress at its time (and covered the main results from its fore- 
runners), viz. the paper entitled Residuated Lattices [150] from 1938. 
The main purpose in that article is, similarly as in Krull’s first paper, 
an abstract foundation of the commutative ideal theory initiated by 
Dedekind and pushed forward considerably by the impressive work 
of Emmy Noether. Generalizing Dedekind’s lattice-ordered groups, 
Ward and Dilworth consider an arbitrary (not necessarily complete) 
lattice L endowed with an extra commutative monoid operation, called 
multiplication and operating distributively over the join. Furthermore, 
they assume that the greatest element i should coincide with the neu- 
tral element for the multiplication. Thus, such a multiplication must 
satisfy the rules 

a-i = a , 

a-(b-c ) = a-(c-b), 

a-(b Vc) = {a-b) V (a-c), 

and conversely, these identities (together with a < i) imply that one 
has a partially ordered commutative monoid with 

a A b > a-b > (a A 6)-(aV b). 

In particular, 
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aV b = i implies aAb = a-b and aV (b-c) = aV c. 

On the other hand, they consider so-called residuations, i.e. binary 
operations : satisfying 

a:b = i AA a > b, 

(i a:b):c= ( a:c):b , 

(a A b):c= ( a:c ) A ( b:c ). 

In the tradition of divisor and ideal theory, Ward and Dilworth used 
here the notations [a, b] ( “greatest common divisor” ) for the meet and 
(a, b) (“least common multiple”) for the join. Furthermore, they wrote 
a D b instead of > for the order relation defined by 

a > b b < a 44- a Ab = b <£4- a V b = a. 

Only later, they passed to the now common symbols A for the meet, V 
for the join, and > for the order relation. Also, they did not postulate 
the existence of a greatest element explicitly, but most of their con- 
clusions work only under that additional assumption. Instead, they 
included the following rules in the definition of a residuation: 

a > b implies a:c> b:c and c:b > c:a, 

c:(aV b) = (c : a) A (c : b) . 

But these rules are easy consequences of the others: first, one derives 
from the first two postulates for residuations the useful equivalence 

c:a > b c:b > a 

which say that ij ) c ) with ip c ( a ) = c : a is a Galois connection. 

This proves the equation c : (aV b) — (c : a) A (c : b) and, a fortiori , 
the implication a > b => c \ b > c : a. Similarly, the implication 
a > b a: c > b:c follows from the equation (aAb) :c = (a :c)A(b:c). 
Other consequences of the above postulates are the identities 

a:a = i, i:b = i, (a Ab) : b = a : b = a : (a V b ) , 

and the inequalities 

a:b > a, a:(a:b) > a V b, 

(ay b) : c > (a : c) V (b : c) , c : (aAb) > (c:a) V (c:b). 

Verifying a sequence of technical details, Ward and Dilworth prove the 
following remarkable 
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Correspondence Between Multiplications and Residuations 

(1) Every complete lattice L with a multiplication ■ such that 

a-\jB = \J{a-b | be B} for all B C L 

admits a residuation given by c:a > b a-b < c. 

The same holds if L is merely a lattice but the sets {x \ a-x < c} 
satisfy the ascending chain condition. 

(2) Every complete lattice L with a residuation : such that 

f\C :a = f\{c:a \ c G C} for all C C L 

admits a multiplication given by a - b < c c:a > b. 

The same holds if L is merely a lattice but the sets {y \ y:a> b} 
satisfy the descending chain condition. 

Moreover, the above two constructions are inverse to each other. 

These facts and something more can be shown very easily with the 
help of adjunctions. Call a commutative partially ordered monoid 
(M, •) residuated if each of the translations A a : x i — > a-x is residuated 
(in other words, if each of the sets {x \ a-x < c } with a, c G M 
has a greatest element). Similarly, we mean by a commuting residual 
groupoid a partially ordered set M with an operation : such that the 
maps g a : y i— >■ y : a are residual (i.e., the sets {y \ y : a > b} with 
a, b G M have least elements) and commute with each other. The 
top element i is said to be discriminating if it determines the order by 
a:b = i a > b. The operation : is then automatically a residuation 
in the sense of Dilworth and Ward. (However, the converse conclusion 
is problematic for infinite lattices; therefore, the chosen nomenclature 
of “ residuation ” might be a bit misleading.) The above correspondence 
may be now be rephrased as a 

Theorem on Residuated Monoids 

Sending each residuated monoid ( M , •) to the ordered groupoid ( M , :) 
with c:a = max{a:|a-a: < c}, one obtains a one-to-one correspon- 
dence between commutative residuated monoids having a neutral top 
element and commuting residual groupoids having a discriminating top 
element. The bisection in the opposite direction is obtained by sending 
(. M , :) to the partially ordered monoid (M, •) with a-b = min{y | y:a > 
b}. 
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In fact, a commutative residuated monoid may be regarded as a set 
M together with a family of commuting left adjoint maps X a (a G 
M) containing the identity map; on the other hand, a commuting 
residual groupoid may be regarded as a set M together with a family of 
commuting right adjoint maps g a ( a G M). And the passage between 
left and right adjoints, 

a-x = X a (x) < y O x < g a (y) = y:a 

gives the desired bijection, because 

A a (i) = a -i = a < b i < g a (b ) — a:b ■<=> i = a:b. 

Note that the “left translations” x i-» c:x invert the order and trans- 
form all existing joins into meets: 

a = \jB =>- c:a = f\{c:b \ b E B} . 

Indeed, 

x < c:a <=> a < c:x V6 G B (b < c:x) x < f\(c:B). 

The non-commutative case is more complicated (see for example Dil- 
worth [36], Blyth and Janowitz [16]). 

After having established the correspondence between “multiplications” 
and “residuations” , Ward and Dilworth give some necessary and some 
sufficient conditions for the existence of such pairs linked by adjunc- 
tion. They claim the existence of a unique divisor-free element (later 
called coatom ) would be a sufficient condition. That this is not true in 
general is easily seen by looking at the componentwise ordered com- 
plete lattice 

{(a:, 0) : 0 < £ < 1} U {(0, 1), (1, 1)} 

with least element 0 = (0,0), greatest element i = (1,1) and unique 
coatom c = (0, 1) : for a = (a, 0) and b = (/?, 0) with 0 < (3 < a < 1, 
one obtains 

(a-b) V (a- c) < (a A b) V (a A c) = a A b = b < a = a- i — a- (b V c) , 

no matter how one tries to define the multiplication. Apparently, 
Ward and Dilworth mean a sligthly stronger hypothesis, namely that 
the unique coatom dominates all elements distinct from the top. In 
that case, it is easy to define a residuated multiplication by a • i = 
a,i -b = b and a-b = 0 otherwise. This type of examples shows 




Adjunctions and Galois Connections 



117 



that no non-trivial lattice identity is necessary for the existence of a 
residuation - a remarkable discovery at a time where experts believed 
that a reasonable residuation theory might be possible in modular or 
similar lattices only (as suggested by the classical case of ideal and 
submodule lattices). On the other hand, Ward and Dilworth observed 
that certain weakenings of the distributive law 

(a A b) V c = (a V c) A (b V c) 

are necessary for the existence of a residuation. From the inequality 
ab = a-b < aAb and the distributive laws, valid in any residuated 
lattice, one derives the distributive inequality 

(a V c)(bV c) = aby acV cbV cc < aby c, 

which in turn entails the semidistributive law 

ay c = b\/ c = i =>- aby c = i => (a A £>) V c = i , 

It follows that each coatom c has to be prime and, a fortiori , A -prime, 
that is, 

a Ab < c => ab < c => a < c or b < c. 

Motivated by the background from ideal theory, Ward and Dilworth 
call two elements a, b of a residuated lattice L coprime if aV b = i. As 
remarked earlier, this implies ab = a Ab and, moreover, 

aby c= (a A b) V c = (aV c) A (b V c) = (aV c)(b V c) for all c G L, 

since (aV c) V (b V c) = (aV b) V c = * entails 

(aV c) A (b V c) = (aV c)(b V c) < ab V c < (aV c) A (b V c) . 

Now, it is not hard to prove: 

Local Distributive Laws for Residuated Lattices 

Let Abe a finite set of pairwise coprime elements in a residuated lattice 
L. Then 

/\A V c = V c) (= /\{a V c | a G A}) for all c G L. 

If A does not contain the top element i then the map 

cp: 2 a ~VA^B = {/\X \XC A}, X^y/\X 

is an isomorphism, and consequently B is a Boolean lattice. In partic- 
ular, if A is a finite set of coatoms then the elements above f\A form 
a Boolean lattice isomorphic to 2 A . 
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Proof. The identity f\A Vc = /\(dVc) is obtained from the proven 
case A = {a, b} by induction. While Ward and Dilworth establish 
distributivity and complementation of B by explicit calculations, we 
prefer an easier argument using the Galois connection (<p, if) with 

ip : B —»■ VA, b^-{a£A\b<a) 

(note that just by definition, b < <p(X) = f\X <=> X C ip(b)). As (f> 
is onto, it suffices to prove the inclusion tpocp(X) C X for X C A. But 
c £ A\X and c £ xpo ip(X), i.e. f\X < c, would imply c = f\X V c = 
f\(X V c) = i # A. If A consists of coatoms then each c> f\ A satisfies 

c = f\A V c = f\(A V c) = f\{a £ A \ c < a} £ B . □ 

Years later, Dilworth demonstrated that if each interval of a lattice 
has the above property and satisfies suitable chain conditions then 
each element has a unique irredundant join-decomposition into join- 
irreducible elements (actually, Dilworth proved the dual). In the six- 
ties, Crawley and Dilworth were able to weaken the finiteness condi- 
tions considerably (see [24, 38]), as presented in the fascinating book 
Algebraic Theory of Lattices [25]. 

That the local distributivity at the top is not sufficient for the exis- 
tence of a residuation can be checked by a look at the free modular 
lattice with three generators (see the diagram in Section 2.5). Using 
Dedekind’s notation, Ward and Dilworth argue as follows. If : was a 
residuation then 

ao'.d 1 = ( a ' A a "') : d' = a' : d! A a"' :d' = a' :d' A i = a' :d' > a' 

and analogously bo : d’ > b' , cq : d' > c' . Now, from h := oq '■ d' = 
bo ■ d' = c 0 : d' it would follow that h > o' V b 1 V c' = d"" = %, 
contradicting a 0 ^ d' . But, since it is not clear how to confirm the 
claimed equality for h, I propose the following alternative. If there 
would exist a (residuated) multiplication • then 

a'-d' = a'- (bo V c 0 ) = (a 1 -bo) V (a'-co) 

< (a' A bo) V ( a ' A Co) < d\ ; 

similarly b'-d' < d\ and c'-d 1 < di, whence 

d' = ( a ' Vb'V c') ■ d' = (a'-d') V (b'-d') V (c'-d') < d 1 , 
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a contradiction. 

It should be mentioned that the local distributive law for residuated 
lattices cannot be extended to infinite subsets A, as a straightforward 
inspection of the cofinite topology on an infinite set shows. Although 
this is a complete distributive lattice with ascending chain condition in 
which every element is a meet of coatoms, it is certainly not Boolean, 
and the infinite distributive law aV/\l = /\(a V X) fails. 

While even a finite distributive lattice may possess several residu- 
ations, Ward and Dilworth showed that a complemented residuated 
lattice must already be Boolean, and the multiplication must be the 
meet operation. Since their computations are rather complicated, we 
add here a short proof for a slightly stronger result, providing one of 
dozens of possible axiomatizations for Boolean algebras (cf. Hunting- 
ton [76]). 

Characterization Theorem for Boolean Algebras 

If a join-semilattice S with least element 0 and greatest element i car- 
ries a commutative binary operation - and a unary operation ' such 
that 

a-i = a, a-a 1 = 0, aVa' = i, and a-(bW c) = (a-b) V (a-c) 
then a-b is the binary meet , hence ( S , V, •, 0, i ,' ) is a Boolean algebra. 
Indeed, a V a' V b = i implies 

a A b < a A (a' V b) = a- (a 1 V b) = a-a' V a-b = a-b. 

The next chapter of [150] is devoted to the study of so-called Noether 
lattices. These are lattices satisfying the ascending chain condition 
and admitting a residuation so that every meet-irreducible element p 
is primary, i.e. p > a-b implies p > a or p > b s for some s. Of 
course, this class includes all ideal lattices of Noetherian rings, whence 
the name. Ward and Dilworth impressively demonstrate that many 
ring-theoretical results, in particular the fundamental decomposition 
theorems, remain valid in the general realm of Noether lattices. A 
crucial role is played by elements they call principal, such elements c 
are characterized by the property that for each a < c, there is a b with 
a — b-c. In the residuated case, this may be expressed by the equation 
a = ( c:a)-c . It will be convenient to call an element p power-principal 
if all powers p k are principal. A central result in the lattice-theoretical 
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abstraction of ideal theory is now the 

Theorem on Modular Noether Lattices 

Let L be a modular residuated lattice satisfying the ascending chain 
condition. Then the following statements are equivalent: 

(a) L is a Noether lattice. 

(b) Each element has an (irredundant) finite primary decomposition. 

(c) For all a,b € L there is a number k such that a A b k < ab. 
Sufficient for these properties is that 

(d) Each element is a join of power-principal elements. 

Proof. (a)=>(b). By the ascending chain condition, each element is the 
meet of a finite number of meet-irreducibles, and these are primary. 

(b) =^(c). Represent ab as a meet of primary elements p\, ... ,p n ■ Then 
ab < pj implies a A b k j < Pj for suitable exponents kj, and for their 
sum k , it follows that a A b k < pi A ... A p n = ab. 

(c) =^(a). Let m be a meet-irreducible element and consider a,b E L 
with ab < m but a m. Then we have d = aV m > m, and we 
find an s such that d A b s < db = abV mb < m < d. By modularity, 
m = (m V b s ) A d, and as m is irreducible, m = m V 6 s > b s , proving 
primarity of m. 

(d) =^(a): For any power-principal element p < b, the ascending chain 
condition yields a k such that dp:p k = dp:p k+1 . Now, there is a q 
with qp k = dAp k , whence 

qpk+i _ (d/\p k ffi < dp => q < dp:p k+1 — dp:p k => dAp k = qp k 
< dp. 

Replacing b with p in (c)=^(a), one concludes that m > p k . Since 
b is a finite join of such power-principal elements pj, the sum s of 
the corresponding exponents kj with m > pj kj then has the desired 
property m > b s . □ 

Clearly, every ring ideal is a join of principal ideals, and these are 
closed under the formation of powers and even of products (if the ring 
is commutative). Thus, the above theorem actually applies to classical 
ring theory. Modifying the notion of principal elements, Dilworth later 
found still more powerful results (see his own comment and [37]). 
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In the last part, Ward and Dilworth deal with conditions under which 
a residuated lattice is distributive. They show that the postulates 

(a : b) V (b : a) — i 

a : (b A c) = (a : b) V (a : c) 

(6 V c) : a = (b : a) V (c : a) 

are mutually equivalent and entail distributivity. Another sufficient 
condition for distributivity is provided by the following 

Distributive Law for Principal Elements 

In a residuated lattice, the unary meet operations inaAi preserve 
all principal joins: if c = \J B is a principal element then a A c = 
V(oAB). 

Indeed, if d > a A b for all b 6 B then 

d:c = A {d:b \ b e B} > /\{(aAb):b \ b e B} = f \{a:b \ b e B} 
= a :c, 

hence d> (a: c)c = aAc. 

An immediate consequence is the following 

Corollary A complete lattice admits a residuated multiplication such 
that all elements are principal iff it is a locale (that is, a complete 
lattice satisfying the distributive law a A \/ B = \J{a A B) for all 
subsets B ). 

Of course, the prominent examples of residuated lattices in which all 
elements are principal are the ideal lattices of principal ideal domains. 
But also in every lattice-ordered group, all elements are trivially prin- 
cipal. This yields an alternative proof of Dedekind’s theorem on the 
distributivity of lattice-ordered groups. But recall that such a group 
never can be a complete lattice, lacking a greatest and a least element. 



4.5 T.S. Blyth and M.F. Janowitz: Residuation 
Theory 

The first comprehensive monograph about residuation theory was writ- 
ten by T. Blyth and M.F. Janowitz in the early seventies [16], after a 
period of 
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approximately 20 years where the general theory of residu- 
ated mappings seems to have lain dormant 

as the authors write in the introduction. According to the main 
themes, the material is divided into three chapters: 

1. Foundations 

2. Coordinatizing Baer Semigroups 

3. Residuated Algebraic Structures 

Besides some basic and some more advanced facts on orders and lat- 
tices, the first chapter contains, of course, a section about adjoint 
pairs of mappings. The emphasis is here on descriptions of various 
properties for mappings between ordered sets by means of principal 
ideals. For example, so-called quasi-residuated mappings A are char- 
acterized by the condition that inverse images of principal ideals be 
order ideals (i.e. nonempty down-sets), or equivalently, by isotonicity 
together with the existence of a map g in the reverse direction so that 
Xog < id. The characterizing equivalence for closure operations, here 
termed closure mappings, is written as an equation: 

i*~[v) = 7 ^[t (y)), 

where ^[x) denotes the preimage of the principal dual ideal gener- 
ated by x. Clearly, kernel maps are characterized by the dual equa- 
tion. After these “forerunners” , residuated mappings A are introduced 
as mappings such that inverse images of principal ideals are again 
such, and are characterized by the existence of a map g in the reverse 
direction with Xog < id and goX > id. Thus, the map g is the uniquely 
determined right adjoint or residual of A : P — » Q and denoted by A + . 
The map 

V{X) = A+ : V{Q) -► V(P) 

between the dually ordered sets V(Q) = Q d and V(P) = P d is then 
again residuated, with 

VoV(X) = X and V(Xoip) = V(<p) o V(X). 

As a consequence of the definitions, the closure mappings are pre- 
cisely the composites of a residuated map with its right adjoint, and 
the kernel mappings are precisely the composites in the opposite or- 
der. Of basic importance for structural investigations is the fact that 
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the composition of residuated maps yields again residuated maps, so 
that the ordered sets together with the residuated maps form a self- 
dual category under the self-inverse endofunctor V. A further useful 
property of the duality functor V is that it preserves the pointwise 
order between residuated maps. Thus, 

X < p O D( A) < P(g) , 

or explicitly (observing that V{P) and V(Q) carry the dual order), 

Vp G P (A (p) < fi(p)) O VqeQ (g. + (q) < A +(?)) . 

Immediate from these facts is the following table, providing the trans- 
lation of properties for residuated maps to their adjoints, and vice 
versa. 



residuated map A 
(left adjoint) 


residual map g = X + 
(right adjoint) 


identity or 
inequality 


injective 


surjective 


goX = id 


surjective 


injective 


Xog = id 


isomorphism 


isomorphism 


goX = id, Xog = id 


extensive 


contractive 


X > id, g < id 


contractive 


extensive 


X < id, g > id 


idempotent 


idempotent 


II 

Cd 

O 

II 

0 


closure map 


kernel map 


0 

II 

Qj 

0 

II 


kernel map 


closure map 


X — Xog, g = goX 



By the above remarks, the residuated maps on a fixed ordered set P 
form a (partially) ordered semigroup Res(P) under composition and 
pointwise order, and the functor V induces an isomorphism between 
Res(P) and Res (P d ) . 

As indicated in the title, the second chapter deals with a special class 
of partially ordered semigroups, the so-called Baer semigroups. They 
are the basic tool for the coordinatization of arbitrary bounded lattices 
L, i.e., the isomorphic representation of L as the lattice of certain left 
or right principal ideals of suitable semigroups or rings. This theory is 
closely related to but easier than John von Neumann’s famous theory 
of continuous geometries [145] (which implicitly contains some further 
interesting examples of Galois connections), but of course also with 
Reinhold Baer’s work on ring theory and projective geometry (see e.g. 
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A left (right) ideal of a semigroup S' is a subset / with SI C I (respec- 
tively, IS C I), and a (two-sided) ideal is both a left and a right ideal. 
Since the left (right, two-sided) ideals of a semigroup are closed under 
arbitrary unions and intersections, they form an Alexandroff topol- 
ogy. Subsets of the form xS = {xy : y G 5} and Sy = {xy : x G S} 
are referred to as principal left or right ideals, respectively. Denote 
by CX(S) and IZX(S) the closure systems of all left and right ideals, 
respectively, and by VCI(S) and VHI(S) the systems of all principal 
left and right ideals, respectively. 

If K is a left ideal and M is an arbitrary subset of S then the left 
residual, quotient or K -annihilator of M is the left ideal 

Lk{M) — {x G S : xM C K} (where xM = {xm : m G M}) , 

and if K is a right ideal then the right residual, quotient or K -anni- 
hilator of M is the right ideal 

Rk{M ) = {x G S : Mx C K} (where Mx = {mx : m G M}) . 

Thus, if K is a fixed ideal, the equivalence 

I C L k (J) ^ IJ C K J C R k (I) 

for / G jCX(S) and J G IZI(S) gives rise to an obvious Galois connec- 
tion between CI(S) and IZI(S), but in general not between VCT{S) 
and V1ZX(S), nor to an isomorphism between £I(S) and IZI(S) . The 
relation inducing the polarity in question is given by xRxy xy G 
K. 

Now, Blyth and Janowitz introduce Baer semigroups as pairs ( S , K) 
consisting of a semigroup S and a nonempty ideal K such that the left 
and right K-annihilator ideals L K (x) and Rk{x) of singletons {x} are 
principal and generated by idempotent elements. It is then easy to see 
that S must have a two-sided identity element 1 and that K is of the 
form kS = Sk for a central idempotent k (i.e. kk = k and kx = xk 
for all x G S). Thus, a Baer semigroup may also be defined as a pair 
(. S , k) with a central idempotent k such that for each x G S, there 
are idempotent elements k x, x k with Lk(x) = S k x and Rk{x ) = x k S 
(where L stands for L k and R for Rk)- It is then also a custom to 
say that S' is a Baer semigroup with focus k. Often, k will be a zero 
element 0, satisfying xO = Ox = 0. In that case, it will suffice to speak 
of the Baer semigroup S. A Baer ring is defined by the postulate 
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that its multiplicative semigroup be a Baer semigroup with focus 0. 
Of course, any monoid with 0 and 1 having no proper zero divisors 
is a Baer semigroup (with focus 0). Here, all left or right annihilator 
ideals consist of the zero element only. Other important examples of 
Baer semigroups are: 

(1) Any multiplicative subsemigroup S' of a Baer ring containing all 
idempotents; in particular, the multiplicative semigroup of any 
regular ring in the sense of von Neumann [145]. In such rings, for 
each x there is a y with xyx = x ; equivalently, each principal left 
or right ideal is an annihilator; here, one may take k x = 1 — xy 
and x k = 1 — yx). 

Examples: all rings with identity having no proper zero divisors 
(in particular all division rings), all Boolean rings, all matrix 
rings over division rings, and all rings of bounded operators of 
Hilbert spaces. 

(2) The semigroup 1Z = V(X x X) of all relations on a set X, 
with the product RS = {(x,z) : By (xRySz)} and the empty 
relation as focus. Here, for S £ 71, an easy computation gives 
Rt(S) = S 0 R with S 0 = {(x,x) : x £ X\SX} = S 0 S 0 , and 
analogously for Lq,(S). 

(3) All pseudocomplemented semilattices. They are just those Baer 
semigroups with focus 0 in which all elements are central idem- 
potents. Similary, any element k of a Brouwerian semilattice 
S (see the next section) gives rise to a Baer semigroup ( S,k ). 
In that situation, L k (x) = R k (x) = x k S, where x k is the rela- 
tive pseudocomplement of x with respect to k, i.e. the greatest 
element y with x Ay < k. 

(4) The semigroup S = Res(L) of all residuated self-maps of a boun- 
ded lattice L, with the constant zero map as focus. Here, the 
construction of idempotent generators for annihilator ideals is a 
bit more involved. It is easy to check that for any x £ L, the 
map 9 X : L — >■ L, defined by 9 x (y) = y for y < x and 0 x (y) = x 
otherwise, is an idempotent element of S. Now, given arbitrary 
9,p e S, one observes that p £ Rq{9), i.e. 9oip = 0, is equivalent 
to y>(l) < 0 + (O) =: x. But 1) < x entails <p = 9 x op (because 
piy) < x for all y £ L); and conversely, if ip = 9 X op then <p{x) = 
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6 x (<p( 1)) < 9 X ( 1) = x. From this it follows that Rq( 0) = 9 x oS. 
A similar construction gives L 0 (6) = S o tf) y with y = 0(1) and 
if y (x) = 0 if x < y, ipy(x) — x V y otherwise. 

Theorem on Ideal Lattices of Baer Semigroups 

For any Baer semigroup (S', k), 

C k = L k {S ) = { L k (x ) :iG5} and 7 Z k = R k (S) = { R k (x ) :ieS} 

are dually isomorphic bounded lattices. The Galois connection (L k , R k ) 
induces mutually inverse dual isomorphisms between these lattices. 

Proof. By definition of Baer semigroups, C k and 7 Z k are the ranges of 
the Galois connection (L k , R k ) between CV1(S) and 7 ZVT(S). Clearly, 
kS = Sk — L k { 1) = R k { 1) is the least and S = L k (k) = R k (k) is the 
greatest element of C k and of 7 Z k . The only detail that remains to 
be shown is that the intersection of two members eS, fS G 7Z k (with 
idempotent e, /) is again in 7Z k . This and the corresponding state- 
ment for C k together with the dual isomorphism between those posets 
then give the desired lattice properties. First, note that eS, fS G 7Z k 
imply R k ( k e) = R k (L k (eS )) = eS and R k ( k f) = fS. Next, put 
h = k ef and g = fh k . Idempotency of / yields hg = hh k G kS and 
g = h k g = fg = gg. Now, if y G gS then y = gy = fh k y G fS, hence 
k ey = hh k y G kS, and so y G R k ( k e) H fS = eS C I fS. Conversely, 
y G R k ( k e ) fl fS implies k ey G kS and y = fy, hence hy = k ey G kS 
and so y = fy G h k S, i.e. y G fh k S = gS. Thus, we have 
gS = eSnfS = R k ( k e)r\R k ( k f) and therefore gS = R k (L k (gS )) G 7Z k . 

□ 

Generalizing various notions of coordinatizing objects in geometry and 
algebra, Blyth and Janowitz say that a Baer semigroup (S, k ) coordi- 
nates a poset E in case E is isomorphic to the right A;-annihilator 
poset R k (S), hence dually isomorphic to L k (S). Then they prove the 
powerful 

Coordinatization Theorem for Bounded Lattices 

Any bounded lattice L is coordinatized by the Baer semigroup S = 
Res(L) with focus 0 ; i.e. L is isomorphic to Rq(S ) and dually isomor- 
phic to Lq(S). Hence, the following three statements on a poset E are 
equivalent: 

(a) E is a bounded lattice. 
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(b) Res (E) is a Baer semigroup with focus 0. 

(c) E can be coordinatized by a Baer semigroup. 

Proof. We have already seen that S = Res(L) is a Baer semigroup, 
and that any member of Rq{S) is of the form Ro(9) = <poS where 
c p is an idempotent element of S. For any such ip, it easily follows 
that tp( 1) = 0 + (O). Therefore, one obtains a well-defined map A : 
Ro(S) — > L by setting A(poS) = p{l) = # + (0). Indeed, each of the 
implications 

ipoS C ipo S => ip = poip =>- ip(l) = p(ip(l)) 

=$■ ip( 1) < p{l) = # + (0) =>- ip e Ro(9) 

may be inverted because the last statement implies the first. This 
shows that A is an order embedding. In order to prove surjectivity, 
pick any x 6 L and put 9{y) = 0 if y < x and 6(y) = 1 otherwise. 
Then 9 G S, and one finds an idempotent <p with Rq(9) = (poS. Thus, 
A((po5) = ^(l) = 0+(O) = a:. □ 

The major part of the second chapter is devoted to a series of more 
difficult theorems characterizing modular complemented lattices, or- 
thomodular lattices and Boolean lattices by means of various specific 
classes of coordinatizing Baer semigroups. This provides a nice frame- 
work for John von Neumann’s coordinatization of continuous geome- 
tries [145]. These are join- and meet-continuous, modular, comple- 
mented and irreducible complete lattices and may be represented as 
certain ideal lattices of regular rings. Von Neumann’s proof of that 
deep result is one of the longest and most difficult proofs in lattice 
theory and its relationship to classical algebra. 

The third chapter of the book by Blyth and Janowitz deals with diverse 
kinds of residuated algebraic structures having a binary operation, i.e. 
residuated groupoids. Among them are Querre semigroups, Dubreil- 
Jacotin semigroups and various types of semigroups admitting gen- 
eralizations of the Glivenko-Frink theorem about the Booleanization 
of pseudocomplemented semilattices. Useful tools in that theory are 
diverse types of congruences like Molinaro congruences, zigzag equiva- 
lences etc. which ensure that the image of a given residuated groupoid 
or semigroup under a (residuated) homomorphism becomes a partially 
ordered (quasi) group, a Boolean lattice or a similar structure having 
a specific kind of residuation. 
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4.6 Heyting Algebras, Locales and Pointfree 
Topology 

Long overdue now is a remark concerning residuated meet operations 
and their relationship to non-classical logics, modelled by Brouwerian 
(semi)lattices or Heyting algebras. The reader familiar with intuition- 
istic logic will be aware of the strong similarity between the system of 
axioms for residuations and systems of axioms for implicational logic. 
For example, the rule 

a:b = i iff a > b 

resembles the “deduction theorem” 

a— »■ 6 = T iff ahb 

providing the basic connection between formal implication, deduction 
of formulas, and metalinguistic conclusion. However, in order to make 
the translation more consistent, it is better to translate a : b into 
b —*■ a and b < a into b b a if the multiplication is replaced with 
A and interpreted as conjunction (“and”). Then the other rules for 
residuations obtain the following form: 

c — y (b — y a) — b — y (c — y a ), 

c — y af\ b = (c — y a) A (c — y b), 

aV b — >■ c = (a — » c) A (b — >• c), 

all well-known from intuitionistic logic and other formalized languages 
of logic. The identity 

a : ( b-c ) = ( a:b):c 

turns into the rules of importation and exportation: 
c A b — y a = c — y (b — y a ) , 
while the equivalence 
cf\b\~ a iff c h y a 

may be viewed as a “semiformal” variant of these rules. Of course, 

these coincidences are not casual: indeed, the algebraic counterpart 
of intuitionistic logic is the notion of Heyting algebra [71]. This is a 
bounded lattice with residuated meet operation, or, what amounts to 
the same thing, with a residuated multiplication making all elements 
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idempotent (then the inequality a A b = (a A b) 2 < ab < aAb shows 
that ab must coincide with the meet aAb). 

By the basic fact that in complete lattices, the left adjoint maps are 
precisely the join- preserving ones, it is clear that the complete Heyting 
algebras are just the frames or locales , characterized by the distributive 
law 

a A \/ B = \/{a Ab : b € B}. 

There exists an immense literature on that topic, of which we only 
mention Johnstone’s excellent book Stone Spaces [78]. Perhaps some- 
what unexpectedly, in the last few decades logic and topology amalga- 
mated to a quite powerful synthesis, profiting from both theories (and, 
of course, from lattice theory and the theory of adjoint maps). The 
theories that arose from that fruitful interplay are pointfree topology 
and the more recent formal topology. 

Let us conclude our journey through the world of adjunctions and Ga- 
lois connections by the remark that meanwhile there exist extremely 
powerful categorical extensions of the modest order-theoretical notion 
of adjunctions, like categorical Galois connections and adjoint functors 
(see, for example, the early paper by Lawvere [87] on adjunctions, the 
nice article by Herrlich and Husek [70] on four types of categorical Ga- 
lois connections of increasing generality, and the inexhaustible source 
Abstract and Concrete Categories by Adamek, Herrlich and Strecker 
[3]). Another very modern theory in that context is the theory of 
Chu spaces [115], which combines categorical methods with the idea 
of multivalued contexts, known from formal concept analysis. For 
quite recent material on the role of (classical and categorical) Galois 
connections in pointfree topology and its links to classical topology, 
we refer to the paper General Stone Duality [52] and to the article The 
polarity between approximation and distribution in this volume. 



The slogan is: Adjoint functors arise everywhere. 



S. Mac Lane in 
Categories for the Working Mathematician 
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Categorical Galois Theory: Revision 
and Some Recent Developments 

G. Janelidze 



Abstract 

This expository paper presents a short review of categorical Galois theory, 
with special attention to the connection with A. R. Magid’s Galois the- 
ory of commutative rings and most recent developments in the theory of 
generalized central extensions. In the last section some open questions are 
proposed. 
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1 Introduction 

The so-called fundamental theorem of classical Galois theory (GFT 
for short) describes the lattice of subextensions of a finite Galois field 
extension E/K , and - according to its long formulation presented in 
many modern textbooks in algebra - also says a few words about 
the morphisms and especially automorphisms of those subextensions. 
However all this information is contained in one sentence, which is 
a corollary of a very special case of Grothendieck’s formulation of 
GFT: it says that the opposite category of subextensions of E/K is 
equivalent to the category of transitive Gal(E/K ) - sets. Categorical 
language also brings many other improvements and simplifications, 
and in particular allows algebraic geometers to explain that the Galois 
theory of fields and the theory of covering spaces are not just analogous 
but identical; in particular the fundamental group (of a scheme) is 
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nothing but the Galois group of the universal covering, i.e. the largest 
Galois group. 

Grothendieck’s GFT has been extended to an elementary-topos-theo- 
retic context in various ways by M. Barr, Diaconescu, J. Kennison, G. 
Wright, A. Joyal and M. Tierney, I. Noerdijk, M. Bunge and others, 
and there are many other non-topos-theoretical extensions, such as the 
one proposed by Y. Diers [19]; A. R. Magid’s theorem [55, Theorem 
IV. 31] seems to be especially important - since it is the final step in 
the extension of (separable) Galois theory from fields to commutative 
rings. Magid’s theory is indeed beyond the topos-theoretical level since 
it involves profinite groupoid actions on profinite topological spaces, 
and those do not form a topos. 

A purely categorical approach to Magid’s theory was first briefly de- 
scribed in [27] and then in [28], where it was already transformed into 
categorical Galois theory , with a new example of central extensions of 
groups - which is very far from Grothendieck’s Galois theory. These 
results were reported to Saunders Mac Lane, and presented in his talk 
[54] - and the next paper [30] was written according to his suggestion, 
with an additional page written by him for the introduction. Various 
further developments are briefly described in the book [4], whose main 
purpose is to show a precise chain of generalizations from the classical 
level of finite field extensions to the purely categorical level. 

This expository paper presents a short review of categorical Galois 
theory (with special attention to the connection with A. R. Magid’s 
Galois theory of commutative rings and most recent developments in 
the theory of generalized central extensions), where however I was try- 
ing to exclude the material that would copy the corresponding parts 
of [4], for example the categorical approach to covering spaces de- 
scribed in Chapter 6 of [4]; some “ minimal” definitions from [4] are 
replaced by more refined ones, especially useful for formulations of 
open questions and for a few observations that did not exist when the 
manuscript of [4] was submitted for publication. 

The paper is organized as follows: Section 1 shows how the topological 
notion of connectedness becomes categorical, and transforms it into 
what is called Galois structure , which is an abstract adjunction be- 
tween categories with specified classes of morphisms called fibrations. 
Several examples of Galois structures are given. The term “fibration” 
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is used for the first time; in fact it is suggested by the example studied 
in [8] (see Example 2.12). 

Since the role of Galois group(oid)s in general is played by internal pre- 
categories, the internal precategories and their actions are described 
in Section 3; a sufficient condition on a precategory action to become 
a category/groupoid action is also given. 

The categorical GFT in Section 4 is formulated essentially as in [32], 
and then some related notions are defined and some examples are 
described in Section 5. 

The links and applications to Galois theory of commutative rings are 
briefly described in Section 6, and the theory of generalized central 
extensions is considered in Section 7. 

Several open questions are listed in Section 8. 

1 would like to thank Klaus Denecke and the other organizers of the In- 
ternational Conference on Galois connections in Potsdam for inviting 
me; this paper is an extended version of the talk I gave there. 

2 Connectedness and Galois structures 

Usually connectedness is considered as a topological notion, but as 
the following simple and well known theorem shows, there are several 
ways to define it in a purely categorical language: 

Theorem 2.1. The following conditions on a topological space A are 
equivalent: 

(a) A is connected; 

(b) A is not the initial object in the category Top of topological spaces, 
and if A « B+C in Top, then either B or C is canonically isomorphic 
to A; 

(c) A is not the initial object in Top, and if A « B + C, then either 
B or C is the initial object in Top; 

(d) any morphism from A to a coproduct in Top factors through one 
of the coproduct injections; 

(e) the functor hom(A,-) : Top — > Sets preserves coproducts. 
Moreover, replacing Top with a general category C with coproducts, 
we still have the implications (e)=>(d), (d)=>(b), (c)=>(b), and if the 
coproducts in C are universal and disjoint (in A. Grothendieck’s ter- 
minology - that is, C is infinitary extensive in modern terminology), 
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then the converse implications also hold. In particular the conditions 
(b) - (e) are equivalent whenever C is locally connected in the sense 
of 2.2(c) below. 

Definition 2.2. Let C be a category with coproducts and finite limits. 
Then 

(a) an object A in C is said to be connected if the functor hom(A, — ) : 
C — > Sets preserves coproducts; 

(b) the category C is said to be connected if its terminal object is 
connected; 

(c) the category C is said to be locally connected if every object in C 
is a coproduct of connected objects. 

For the readers familiar with topos theory, we point out that C is not 
required to be a topos; however the first example in the following list 
in fact recalls the topos-theoretic motivation for 2.2(c). 

Example 2.3. (a) Let A be a topological space. The categories 
(Top l A) of bundles over A and Shv(A) of sheaves (of sets) over 
A are connected if and only if so is A; the category Shv(A) is locally 
connected if and only if A is a locally connected topological space. 

(b) If a category C is locally connected, then so is (C j, A), for any 
object A in C; the category (C j. A) is then connected if and only if 
so is A. 

(c) The category Cat of all categories is connected and locally con- 
nected; an object A in Cat is connected if and only if for every two ob- 
jects a and b in A, there exists a zigzag of morphisms a —X— . . . b 
between a and b. The category (Cat 4- A) is always locally connected, 
and is connected if and only if so is A (by (b)). This easily implies 
the same properties for the functor category Sets' 4 . In particular the 
following structures form categories that are connected and locally 
connected: M- sets (where M is a monoid), presheaves (of sets) over a 
fixed topological space, graphs, simplicial or cubical sets, etc. 

A locally connected category C can always be identified, up to a cate- 
gory equivalence, with the category Fam(Conn(C)) of families of its 
connected objects. Under this identification 

A rZj E Ai in C corresponds to A = (Aj)j e /inFam(Conn)(C); (2.1) 
iei 
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we will write 1(A) instead of I in order to consider it as a functor 

/ : C -4 Sets, (2.2) 

which in fact comes from the 2-functor Fam : Cat -4 Cat as 

/ = Fam(! Conn ( C )) : Fam(Conn(C)) — > Fam(l) = Sets, (2.3) 

where !conn(C) : Conn(C) -4 1 is the unique functor from Conn(C) 
to the terminal object in Cat. 

If in addition C is connected, then the functor (2) has a right adjoint 

H : Sets -4 C, (2.4) 

defined by H(X) = X • 1, where X ■ 1 is the coproduct of “ X copies” 
of a fixed terminal object in C. And again, the adjunction bfCq Sets 
is nothing but the image of the adjunctionConn(C)<jl under the 2- 
functor Fam. Moreover, the functor H itself has a right adjoint, 
namely hom(l, — ). 

Let us now assume that C is equipped with a fixed adjoint pair of func- 
tors (2.2) and (2.4), which were determined so far only up to an isomor- 
phism of course. This suggests to say that each object A in C has its 
/(A)-indexed family (Ai) ieI (A) of connected components, in which each 
Ai is defined as the pullback 



Ai 

i 

A 



VA 



l 

HI (A) 



(2.5) 

where tja is the unit of the adjunction, and the right hand vertical 
arrow is the image of the inclusion map {z} -4 1(A) under the functor 
H. 

Our next step is to make the notion of connectedness much more ab- 
stract by replacing Sets with an arbitrary category X. The structure 
which we obtain provides an appropriate level of generality for categor- 
ical Galois theory and therefore it is called Galois structure, although 
the definition we are giving here is slightly different from the original 
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one given in [32]. Its absolute version (see Definition 2.4(e)) is closely 
related to various other structures that occur in general connected- 
ness/ radical/ torsion theories (see [57] and references there). 

Definition 2.4. A Galois structure T on a category C with finite 
limits consists of an adjunction 

(I,H,r],e): C^X (2.6) 

and two classes F and $ of morphisms in C and X respectively, whose 
elements are called fibrations; the following conditions on fibrations 
are required: 

(a) all pullbacks along fibrations exist, and the classes of fibrations are 
pullback stable; 

(b) the classes of fibrations are closed under composition and contain 
all isomorphisms; 

(c) the functors I and H preserve fibrations. 

Such a Galois structure is said to be finitely complete, absolute, ad- 
missible, or closed, if it satisfies the following conditions respectively: 

(d) the categories C and X have all finite limits (usually there are all 
small limits, but we will never use them); 

(e) all morphisms in C and in X are fibrations; 

(f) for every object C in C and every fibration ip : X — >• 1(C) in 
X, the composite of the canonical morphisms I(C x.hi{c) H(X)) — >■ 
IH( X) — >■ X is an isomorphism; 

(g) for every object A in C, the morphism t)a '■ A — > HI (A) is a fibra- 
tion, and for every object X in X, the morphism : IH(X) — > X is 
an isomorphism. 

Let us consider/ recall some examples: 

Example 2.5. The following data determine a Galois structure sat- 
isfying all additional conditions (i.e. it is finitely complete, absolute, 
admissible, closed) of Definition 2.4: C any connected and locally 
connected category, X = Sets, / and H the functors (2.2) and (2.4) 
respectively, and F and $ the classes of all morphisms in C and X 
respectively. 

Example 2.6. The category C of locally connected topological spaces 
has an admissible Galois structure defined as in Example 2.5 but with 
F being the class of local homeomorphisms. 
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Example 2.7. Any adjunction (/, H, 77 , e) : C — v X becomes a Galois 
structure if we take F and $ to be the classes of isomorphisms in C 
and X respectively. It is always admissible, and closed if and only if 
it is a category equivalence. 

Example 2.8. Any pair (C ,F) consisting of a category C and a 
class F of morphisms in C satisfying the conditions 2.4(a) and 2.4(b), 
determines an identity Galois structure; it consists of the identity ad- 
junction (1, 1, 1 , 1) : C — > C with F = $, and it is always admissible 
and closed. 

Example 2.9. There is an obvious finite version of example 2.5. It 
involves finitary local connectedness instead of local connectedness and 
Finite Sets instead of Sets. That is, the category C is required 
to have finite coproducts, and every object in C must be a finite 
coproduct of connected objects. In particular C could be the opposite 
category of finite-dimensional commutative (unitary) algebras over a 
field. 

Example 2.10. There is also a profinite version of Example 2.5, with 
X = Profinite Spaces = Stone Spaces. It is in fact described in 
[10]. Let us restrict ourselves here to the following two special cases: 

(a) C is the opposite category of commutative (unitary) rings. In this 
case the functors / and H have the following description: 1(A) is the 
Boolean spectrum (=Pierce spectrum) of the ring A, i.e. it is defined 
either as the Stone space (=the space of maximal ideals) of the Boolean 
algebra of idempotents in A, or as the space of connected components 
of the Zariski spectrum of A\H(X) is the ring of continuous maps 
from the space X to the ring of integers equipped with the discrete 
topology. 

(b) C is the category of compact Hausdorff spaces, I carries spaces 
to the spaces of their connected components, and H is the inclusion 
functor. 

In both cases all the additional conditions of Definition 2.4 hold. In 
the case (a) the admissibility was used in [27] (see also [28]) in order 
to show that A.R. Magid’s Galois theory [55] extends to general cate- 
gories. The case (b) was used in [11] to give a categorical description 
of the monotone-light factorization of continuous maps of compact 
spaces. 
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Example 2.11. Let C be a variety of universal algebras, X a subva- 
riety in C ,(I,H,r),e) : C —> X the reflection - inclusion adjunction, 
and F and $ the classes of regular epimorphisms (^surjections) in 
C and X respectively. This Galois structure is finitely complete and 
closed. Moreover, as shown in [34] (in fact in a more general context), 
it is admissible whenever C is congruence modular. 

Example 2.12. Let C be the category of simplicial sets, X the cat- 
egory of (small) groupoids, (I, H,rj,e) : C — > X the fundamental 
groupoid - nerve adjunction, and F and the classes of Kan fixa- 
tions in C and X respectively. The admissibility of this Galois struc- 
ture is used in [8], and the same can be done for many other Quillen 
homotopy structures. 

Most of these and some other examples actually occur in various Galois 
theories mentioned in [4, Appendix A4]. 

We will use the notation of Definition 2.4 for a fixed Galois structure 
T, and for each object C in C, write 

(I c , H c , r] c , e c ) : F(C) -4 $(/(£)) (2.7) 

for the induced adjunction between the category F(C) of fibrations 
over C and the category $(/((?)) of fibrations over 1(C). 

Note that for an object (X,ip) in Q(I(C)) - the (X,ip) - component 
of the counit e c is nothing but the composite involved in Definition 
2.4(f). Therefore T is admissible if and only if this counit is an iso- 
morphism. On the other hand the admissibility condition 2.4(f) well 
agrees with the intuitive idea of connectedness, since applied to the 
map {*} — > 1(A) used in the pullbacks of the form (2.5), it just says 
that the connected components must themselves be connected. 

3 Precategory actions 

From now on we have to assume that the reader is familiar with some 
basic notions of internal category theory, described for instance in [4], 
This should not create problems for “non-experts” since these notions 
are as simple and natural as their well-known special cases, such as 

• internal groups (= group objects) used for example in topology 
and analysis, where internal groups in the category of Hausdorff 
spaces are called topological groups; 
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• internal equivalence relations used for example in Universal Al- 
gebra, where they are called congruences. 

Let C be an internal category in a category C with finite limits and 
I : C — y X an arbitrary functor. If / preserves finite limits, then 
we can say that the image /(C) is an internal category in X; if not, 
then still, there are natural ways to say that /(C) is a “ generalized 
internal category” and to define an internal /(C)-action in X. Trying 
to use the simplest possible structure carried by 1(C ) , one arrives at 
the following 

Definition 3.1. (a) An internal precategory P in a category X is a 
diagram 



P d 

P 2 ™ p 0 ( 3 - 1 ) 



in X with de — 1 = ce, dp = cq, dm — dq , and cm — cp. 

(b) Let P be an internal precategory as above. An internal P-action in 
X is a triple A = (Ao, 7 r, £), in which n : Ao — >• Pq and £ : Pi Xfd^) A 0 — > 
A 0 are morphisms in X in such that all pullbacks along 7r exist (and so 
Pi X(d, 7 r)Ao and all pullbacks below are well-defined), and the diagrams 



P\ ^ (d,7r) Aft 



£ 



Ao 



proj x 



7T 



Pi ~Po 



(3.2) 



c 




148 



G. Janelidze 



< p, q > Xl 



1 X ^ 



< en , 1 > 



^2 X (dg, 7r) ^-0 



m X 1 



Pi X (d,7r) ^0 




(3-3) 

commute. The category of internal P-actions in X will be denoted by 

X p . 

The notation we are using may not seem very natural if one considers 
an internal precategory as a truncated simplicial object; however it 
follows the internal-categorical notation: indeed, when P is an internal 
category, 

• Pi(i = 0,1,2) are its object of objects, object of morphisms, and 
object of composable pairs respectively, 

• d and c the domain and codomain, 

• p and q the pullback projections, 

• e and m the identity and the composition. 



Remark 3.2. If X = Sets, it is convenient to use the following “ 
mixture” of the categorical and simplicial terminology and notation 
for a precategory P above: 

• The system Pi P 0 = (Po, P\,d,c) is called the underlying graph 
of P; accordingly we will write u : x — >■ y and call u a morphism 
from the object x to the object y in P whenever u is an element in 
Pi with d(u) = a and c(u) = b. We will also write e(x) = l x , and call 
(Po, Pi, d, c, e) the underlying reflexive graph of P. 

• The elements of P 2 are called triangles in P, and displayed as 
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whenever p(t) = u, q(t) = v and m(t) = w, if P was a category, then 
the triangles would be the same as composable pairs of morphisms, t 
would be identified with (v, u) and w = vu. 

The P-actions can be described as: 

(a) the diagrams D : (PiDP 0 ) — »■ Sets with D(l x ) = l B (x) for every 
object x and D(v)D(u ) = D(w) for every triangle (2.4); 

(b) the morphisms of precategories from P to Sets; 

(c) the functors from L(P ) to Sets, where L is the left adjoint to the 
inclusion functor from the category of categories to the category of 
precategories; explicitly, L(P ) is generated by the underlying reflexive 
graph of P and the set of identities {m(t) = q(t)p(t)\t G P 2 } (where 
m(t) = q(t)p(t) is of course the same as w = vu in the situation (3.4)). 
Since the definition of a precategory is purely equational with no limits 
involved, it is indeed clear that any functor I : C — >• X carries the 
internal precategories in C into internal precategories in X. Thus, 
if C is an internal category in C, then 1(C) is at least an internal 
precategory in X. Moreover, the following fact is well known: 

Proposition 3.3. In the notation above, 1(C) is an internal category 
whenever the canonical morphisms 

I (Ci X(d, c ) Ci) ->• I (Ci) X(/( d ) ;7 (c)) I (Ci) (3.5) 

I (Ci x ( djC ) Ci x ( djC ) Ci) I (Ci) x pep I (Ci) x (pd), pep I (Ci) (3. 6) 

are isomorphisms. Under this condition, if C is an internal groupoid, 
then so is 1(C). 

The following special case will be important for our purposes: 

Let p : E — > B be a morphism in C, for which there exist the 
pullbacks E x B E, E x B E x B E « (E x B E) x E (E x B E), and 
E x B E x B E x B E Pd (E x B E) x E (E x B E) x E (E x B E). Then 
there is the internal equivalence relation Eq(p) on E corresponding to 
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p (= the kernel pair of p ), and this equivalence relation can be con- 
sidered as an internal groupoid in C; its “ precategory part” displays as 



E x B E x B E 



Ex b E+ E (3.7) 



and Proposition 3.3 gives 



Corollary 3.4. In the notation above, I(Eq(p)) is an internal groupoid 
whenever the canonical morphisms 

I{(E x b E) x B (E Xg E)) — > I(E Xg E) xpE) (E Xg E) (3-8) 

!{{E Xg E) Xg ( E x B E) Xg (E x B E)) — >■ I(E x B E) x^ B ) ( E x B 
E)x m I(Ex B E) (3.9) 

are isomorphisms. 



4 The fundamental theorem of Galois theory 



In this section T = ((/, H, tj, e) : C — > X, F C Mor( C), $ C Mor(X)) 
denotes a fixed Galois structure in the sense of Definition 2.4, and 
p : E ^ B a fibration in C, which we will usually write as ( E,p ) and 
call a fibration over B. 

Consider the following two diagrams: 
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I E 



F(E) 


— - <HW)) 




p\ 


Up)'. 


(4.1) 


F(B) 


— - «(/(B)) 




I B 






H e 






F(E) 


$(/(£)) 




P* 




(4.2) 


F(B) . 


•HUB)) 




H B 






the horizontal arrows here are as 


5 in (2.7), and the vertical 


arrows 



form the appropriate change-of base adjunctions. The diagram (4.1) 
obviously commutes, and since the diagram (4.2) is obtained from 
(4.1) by replacing all arrows with their right adjoints, it commutes 
too, up to a canonical isomorphism. 

Definition 4.1. A fibration ( A , /) over B is said to be 
(a) a trivial covering (of B ), if the diagram 

Va 

A *HI{A) 

f HI(f) (4.3) 

B »HI{B) 

Vb 

is a pullback, or, equivalently, if the morphism 

( V B ) iAJ) :(A,f)^H B I B (A,f) 



is an isomorphism. 



(4.4) 
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(b) split over ( E,p ), if p*(A, f) is a trivial covering of E. 

The categories of trivial coverings of B and of fibrations over B split 
over (E,p) will be denoted by TrivCov(B) and Spl(E,p), respectively. 
In other words Spl(E,p) is defined as the pullback 



s P i(E, P y 

inclusion 

F(B ) 



TrivCov(E) 



inclusion 



(4.5) 



F(E) 



P 



If the ground Galois structure T is admissible (see Definition 2.4(f)), 
then clearly a fibration (A, f ) over B is a trivial covering if and only 
if there exists a fibration ( X , (p ) over 1(B) with 



(AJ)kH b (X,p)- 



that is, the functor H B induces an equivalence 



(4.6) 



TrivCov(B) - $(/(#)) 



(4.7) 

Therefore, under the admissibility condition, the pullback (4.5) can 
be rewritten (up to an equivalence) as 
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Spl(E,p ) $(/(£)) 

H e (4.8) 

F(B) ”F(E) 

p* 



Since the diagram (4.2) commutes, the equivalence (4.7) tells us that 
the functor p* carries trivial coverings to trivial coverings, and that 
TrivCov(B) is obviously contained in Spl(E,p). 

If we ask now, what information on F(B) can be obtained using the 
category X, there is a trivial answer - namely the equivalence (4.7) 
that describes TrivCov(B), but there is also a non-trivial one - namely 
Theorem 4.3 below, which describes the larger category Spl(E,p ) 
whenever (E, p) is a monadic extension in the sense of 

Definition 4.2. We will say that (E, p) is a monadic extension, or that 
p is an effective descent morphism, if the functor p* : F(B ) — > F(E) 
is monadic. 

Theorem 4.3. (The categorical form of the fundamental the- 
orem of Galois theory) . If the ground Galois structure T is admis- 
sible and ( E,p ) is a monadic extension of B, then there is a canonical 
category equivalence 



Spl(E,p) ~ X I{Eg{p)) n 4> 

(19) 

where X 7 ^ 9 ^ n 4> is the full subcategory in X 7 ^ 9 ^ with objects all 
triples A = (Ao, 7r, £) ; in which 7r : Aq — > 1(E) is a fibration. 

The first full proof of this theorem is given only in [32], but as was 
already mentioned, the special case proved in [27] (in detail in [28]) 
is sufficient in order to show that the fundamental theorem of Galois 
theory of commutative rings, whose most general version is due to A. 
R. Magid [55], extends to a purely categorical context. The same is 
shown in [4] on a more elementary level but with some long calculation 
omitted. 

The (internal) precategory I(Eq(p )) should be called the Galois pre- 
category of the extension ( E,p ) and denoted by Gal(E,p). It is a 
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groupoid if the conditions of Corollary 3.4 are satisfied, and if so, then 
it is a group if and only if E is connected, i.e. 1(E) is a terminal object 

in X. 



5 Normal extensions, coverings, and related notions 

Using the notation of the previous section and assuming again T to 
be admissible, let us recall 

Definition 5.1. A fibration (A, f ) over B is said to be a covering, if 
there exists a monadic extension (E,p) of B such that (A, f) is split 
over ( E,p ); if in addition the morphism 1(f) : 1(A) —> 1(B) is an 
isomorphism, then we will say that (A, f) is a connected covering. A 
monadic extension (E,p) of B is said to be 

(a) Cartesian-normal, if it satisfies the conditions of Corollary 3.4, i.e. 
the morphisms (3.8) and (3.9) are isomorphisms; 

(b) normal, if it is split over itself; 

(c) a weakly universal covering, if it is a covering and every covering 
of B is split over it; 

(d) projective, if for every monadic (A, f) extension of B there exists 
a morphism g : E — * A with fg = p\ 

(e) a projective covering, if it is a covering, and for every covering 
(A, f) that is a monadic extension of B, there exists a morphism 
g : E A with fg = p; 

(f) a universal covering, if it is a weakly universal and projective cov- 
ering, and every endomorphism of it is an isomorphism. 

Remark 5.2. The universal coverings in the sense of [4, Definition 
6.6.5(h)] (which was adopted especially for the theory of ordinary cov- 
ering spaces) are not the same as those of Definition 5.1(f) here, but 
they always are weakly universal coverings in the sense of Definition 
5.1(c); the converse needs an additional condition, such as closedness 
under composition of the class of all covering morphisms. 

Clearly every weakly universal covering is normal, and as shown in [30], 
every normal extension is Cartesian-normal - and clearly the converse 
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implications are not true in general. It can also be proved that the 
existence of a weakly universal covering implies that every projective 
covering is weakly universal, and that every connected projective cov- 
ering is universal. These and some other links (see also Section 6) 
between the notions listed in Definition 5.1 in some sense are visible 
already in the following simple cases: 





T as in 
Example 2.5 


T as in 
Example 
2.8 


r as in 
Example 2.9 


T as in 
Example 2.11 




with C = G — Sets, 
where G is a 
group; B = 1 


with C as in the last 
sentence there; 

B the ground field 


with C =Groups and 
X = Abelian Groups; 

B any group; 


Trivial 

coverings 


Trivial G-sets 

i.e. sets equipped 
with the trivial 
action of G 


All 

fibrations 


B-algebras that are 

finite products 
of copies of B 


the extensions of B 

that are obtained as 
pullbacks of 
abelian extensions 
of abelianizations of B 


Monadic 

extensions 


nonempty 

G-sets 


Monadic 

extensions 


Nondegenerated 

S-algebras 


All extensions 
of B 


Coverings 


All G-sets 


All 

fibrations 


Separable S-algebras, 

i.e. finite products of 
separable field 
extensions of S 


Central extensions 
of B 


Connected 

coverings 


Transitive G-sets 
with exactly one 
(nonempty) orbit 


Isomorph. 


Separable field 
extensions of S 


Central extensions 
( E,p ) of B, such that 
the homomorphism 
«!<?,□) : Hi(B,D) 
—> Hi (B, □) is an 
isomorphism 


Cartesian 

normal 

extensions 


Nonempty G-sets, 
in which every 
stabilizer 
is a normal 
subgroup 


All monad, 
extensions 


Nonempty finite products 
of those field extensions 
of S in which the subfield 
of separable elements is a 
Galois extension of S 


All extensions 
of B 


Normal 

extensions 


As above 


As above 


Nonempty finite 

products of 

Galois extensions of S 


Central extensions 
of B 


Weakly 

universal 

coverings 


Nonempty 
simple (=free) 
G-sets 


As above 


Nonempty finite products 
of copies of the separable 
closure of B , provided that 
closure is a finite 
extension of B ; otherwise 
there are no weakly 
universal coverings of B 


Weakly universal 
central extensions 
i.e. weakly initial 
objects in the 
category of central 
extensions of B 


Projective 

coverings 


As above 


Projective 

monadic 

extensions 


As above 


As above 


Projective 

monadic 

extensions 


As above 


As above 


Nonempty finite products 
of copies of the algebraic 
closure of B provided that 
closure is a finite 
extension of B; otherwise 
there are no projective 
monadic extensions of B 


Free extensions, i.e. 
the extensions (E , p) 
of B in which E is a 
free group 


Universal 

coverings 


Simple 

transitive 

G-sets 


Projective 
monadic 
extensions 
all whose 
endom. 
are isom. 


The separable closure of 

B , provided that closure is 

a finite extension of B; 
otherwise there are no 
universal coverings of B 


The universal central 

extension of B 

provided B is perfect, 
i.e. B = [B, B] 



If ( E,p ) and ( E',p ') are fibrations over B and p' factors through p, 
then clearly Spl(E,p ) C Spl{E',p'). Therefore whenever (E,p) is a 
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projective monadic extension of B, the category Spl(E,p ) coincides 
with the category Cov(B) of all coverings of B. Therefore Theorem 
4.3 yields: 

Corollary 5.3. In the assumptions of Theorem 4-3, if (E,p) is pro- 
jective, then there is a canonical category equivalence 
Cov ( B ) ~ X 1 ^)) n $ . (5.1) 

If (E,p) is Cartesian-normal and has the property Spl(E,p ) = Cov(B), 
then the internal groupoid I(Eq(p)) should be called a fundamental 
groupoid of B, and if every morphism in X is a fibration, then it is 
uniquely determined up to the Morita equivalence. However it is de- 
sirable to make it unique up to the internal equivalence, which could 
be achieved by requiring (any of) the additional conditions involved 
in 

Proposition 5.4. Let ( E,p ) and(E',p') be Cartesian-normal monadic 
extensions of B. If any of the following two conditions holds, then the 
Galois groupoids I(Eq(p)) and I{Eq(p')) are equivalent as internal 
groupoids: 

(a) The morphisms p and p' factor through each other; in particular 
this is true if (E,p) and ( E',P ') are both projective; 

(b) Spl(E,p) = Spl(E',p'), every morphism in X is a fibration, and 
the Morita equivalence of internal groupoids in X implies their internal 
equivalence; in particular this implication holds for X = Sets. 



6 Galois theory of commutative rings 

The notion of a Galois extension of a commutative ring with a finite 
Galois group was first introduced by M. Auslander and O. Goldmann 
[1], and then several equivalent definitions were found and the funda- 
mental theorem of Galois theory for these extensions was proved by S. 
U. Chase, D. K. Harrison, and A. Rosenberg [15]. The separable clo- 
sure of a connected commutative ring was constructed by G. J. Janusz 
[51], and its abelian version by D. K. Harrision [25]. Galois groupoids 
in connection with idempotents were introduced by O. Villamayor and 
D. Zelinsky, first in the finite and then in the infinite case - as the ti- 
tles of [63] and [64] show. Not mentioning many further developments, 
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such as connections with A. Grothendieck’s work [24] on etale cover- 
ings (that work, where in some sense Galois theory of commutative 
rings was already discovered, was in fact done long before it was pub- 
lished), Galois descent and cohomology, non-commutative separable 
algebras and Chase-Sweedler’s Hopf Galois extension, Harrison func- 
tor and generalized Kummer theory, and separable polynomials, and 
restricting our attention to Galois correspondences and related cate- 
gory equivalences, we still have to mention three important books: 

(i) S. I. Chase and M. E. Sweedler [16] consider the finite Galois exten- 
sion of commutative rings as a special case of what they call (coflat) 
Galois objects in a category with finite products and coequalizers. 
A general theory of Galois objects is developed in the first chapter, 
written by S. U. Chase. Among other things it is shown that in the 
opposite category of commutative rings every Galois object is coflat; 
however the authors do not make the important conclusion that they 
could use torsors instead of the Galois objects (the reader should not 
consider this as a critical remark: note that the book was written more 
than 30 years ago!). 

(ii) F. R. DeMeyer and E. Ingraham’s book [18] is mostly “ non- 
commutative”, but it has a chapter on Galois theory of connected 
commutative rings. 

(iii) A. R. Magid wrote several papers on the extension of Galois theory 
of separable algebras from the connected case to the case of arbitrary 
commutative rings, and completed that work in his book [55]. Magid’s 
Galois theory contains the Villamayor- Zelinsky case [64], and in fact 
it is the final step in generalizing the separable Galois theory of com- 
mutative rings. Magid’s notion of a componentially locally strongly 
separable algebra perfectly combines the following two possible ways of 
involving infinity: on the one hand such an algebra may have infinitely 
many idempotents, and on the other hand its Boolean localizations (in 
the sense of Pierce spectrum - see [59]) may themselves be an infinite 
extension of the corresponding Boolean localizations of the ground 
ring. 

Let us now briefly recall the main known definitions, constructions, 
and results that are needed for or/and belong to Magid’s theory; all 
rings and algebras below are supposed to be commutative (and with 
a unit, and the homomorphisms are supposed to preserve it), and R 
always denotes a ring. 
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6.1. (a) Let Idemp(R) be the Boolean algebra of idempotents in R. 
For each maximal ideal x in Idemp(R) the set Rx = {re|r G R, e G 
x } is an ideal in R and the quotient ring R x = R\Rx is called the 
Boolean localization of R at x. The ring R x is always connected, 
i.e. Idemp(R x ) = {0, 1}. If A is an i?-module, or an iTalgebra, 
then A x is still defined as A x = A\Ax ~ A 0# R x . Given a ring 
homomorphism / : R — >■ S and a y in Idemp(S) , we will write y n 
R = {e G Idemp(R)\f(e) G y} and, regarding S as an irbalgebra, 
also regard S y as an i? y n«-algebra; clearly the S y is one of Boolean 
localizations of S yri R- 

(b) An i?-algebra S is said to be: 

(bi) separable, if it is projective regarded as an S ®r S - module, 

(b 2 ) strongly separable, if it is separable and projective as an R- 
module, and the canonical homomorphism R — > S is injective; 

(b 3 ) locally strongly separable, if every finite subset of it is contained 
in a strongly separable subalgebra; 

(h> 4 ) componentially locally strongly separable, if the canonical homo- 
morphism R — > S is injective, and S x is a locally strongly separable 
ZG-algebra for every x G Idemp(R), or equivalently, S y is a strongly 
separable i^nfl-algebra for every y G Idemp(S); 

(bs) the separable closure of R, if it is componentially locally strongly 
separable, and has the following two properties inside of the category 
of componentially locally strongly separable iTalgebras: every object 
has a morphism into it, and every morphism from it is an injective 
homomorphism; 

(b 6 ) normal, if it is componentially locally strongly separable, and for 
every x G Idemp(R), every two Abalgebra homomorphisms from S to 
the separable closure of R x have the same image. 

(c) For a normal iTalgebra S the objects of the Galois groupoid 
Gal{S/R) are the maximal ideals of the Boolean algebra Idemp(S), 
and a morphism in Gal(S/R) from x to y is an i?-algebra (iso) morphism 
from S x to S y (although there is also a good reason to make it a mor- 
phism in the opposite direction). The morphisms in Gal(S/R ) can 
also be described as maximal ideals in Idemp{S ®r S ), which helps 
to make Gal(S/R) an internal groupoid in the category of profinite 
topological spaces. As follows from the results of [27] (the detailed 
proofs are in [28]), an i?-algebra S is normal if and only if it is normal 
in the sense of Definition 5.1(b) with respect to the Galois structure 
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described in Example 2.10(a), and its profinite Galois groupoid is the 
same as the I(Eq(p )) considered in the previous two sections, where 
p : E — > B is the canonical homomorphism from R to S regarded as 
a morphism S — > R in the opposite category of commutative rings, 
(d) A.R. Magid [55] proves that every ring R admits a unique separable 
closure R, and calls its Galois groupoid Gal(R/R ) the fundamental 
groupoid of R. One of his important discoveries is that the Boolean 
algebra Idemp(R) is larger than the Boolean algebra Idemp(R), and 
in fact it is the completion of Idemp(R). The Main Theorem IV. 31 in 
[55] asserts that there is a category equivalence 
(Componentially Locally Strongly Separable f?-algebras) op ~ 
(Profinite Spaces) Gal ( R l R )p) (Epimorphisms of profinite spaces), 
( 6 . 1 ) 

in the notation used in previous sections. The epimorphisms in the 
category of profinite spaces are of course just (continuous) surjections, 
but moreover, as explained in [55], the space of objects of Gal(R/R ) 
is extremely disconnected (which is the same as to say that the corre- 
sponding Boolean algebra Idemp(R) is complete), which makes every 
epimorphism into it a split epimorphism. 

After this we list the main additional observation brought by categor- 
ical Galois Theory: 

6.2 (a) Let T = ((I,H,r],e) : C X,F = Mor{ C),$ = Mor(X)) 
be the Galois structure described in Example 2.10(a). A fibration 
(A, /) over B is nothing but a (commutative unitary) R-algebra, i.e. 
B and (A, /) play the same roles as R and S' in 6.1. As follows from 
the results of [28], (A, f) splits over ( E,p ) in the sense of Definition 
4.1(b) if and only if for every x e I(E ® B A) the Boolean localization 
(E ®b A) x is isomorphic to E xnB • In fact this easily follows from the 
results of R. S. Pierce [59] (recall that E ® B A is the same as the 
pullback E x B A in C). 

(b) The characterization (a) of split objects helps to prove that the 
following conditions on (A, f) are equivalent: 

(bi) (A, /) is a covering in the sense of Definition 5.1; 

(b 2 ) (A, /) is split over the separable closure of B; 

(bs) for x £ 1(B), A x is a locally strongly separable R^-algebra. 

(c) The crucial part of the proof of (b) is to show that every strongly 
separable algebra is split over some monadic extension. This was done 
in [28] using the finite Galois theory of [15] (and probably known to A. 
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Grothendieck before); however as shown later in [9], even that can be 
done categorically. Note also that the categorical notion of separability 
used in [9] in fact applies only to the “ finite” situations of Example 
2.8. The general notion of separability was introduced and studied in 
[47], and of strong separability in [49]. 

(d) As follows from (b), the Magid’s equivalence (6.1) is a part of a 
larger equivalence 

(R-algebras satisfying (b 3 )) op ~ (Profinite Spaces ^ Gal (R/R) (6.2) 
provided by categorical Galois theory (see Theorem 3.3). Moreover, as 
also shown in [28], there is a categorical modification, which directly 
gives (5.1). Still, this brings the question for the existence of a class of 
morphisms in C = (Commutative Rings) op , which declared as the 
class of fibrations in C, would form a Galois structure with the class 
of epimorphisms in X = Profinite Spaces - and then makes (5.1) a 
special case of Theorem 4.3. The answer is affirmative, and there are 
several possibilities including faithfully flat extensions (see [50]). 

(e) The separable closure in the sense of A. R. Magid [55] is a non- 
connected universal covering in the sense of Definition 5.1 (see 6.1(d)), 
and no other example like that among the known concrete Galois 
theories has been described so far. 

(f) In the last (fifth) chapter of [55] the equivalence (5.1) is used to 
describe the Galois correspondence for commutative rings, which ex- 
tends the classical formulation of the fundamental theorem of Galois 
theory. This result has been improved using the categorical approach 
in [12]. 

(g) As shown in [35] the coverings of commutative rings satisfy the 
categorical conditions that make them a part of a factorization system, 
generalizing the (purely inseparable, separable) factorization for the 
finite field extensions - which however satisfy much stronger conditions 
of [11]. The problem of describing “ the other part” is far from being 
solved, but some related results are obtained in [47] and [48]. 

7 Central extensions of universal algebras 

Let T = ((I,H,ri,e) : C ->■ X, F C Mor( C),$ C Mor(X)) be the 
Galois structure described in Example 2.11. As we already mentioned, 
in this case the coverings become central extensions in the sense of [34], 
and if C and X were the varieties of groups and of abelian groups, 
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respectively, then a fibration (i.e. an extension) (A, /) would be a 
covering if and only if it is a central extension in the usual sense, i.e. 
/ is a surjective homomorphism whose kernel is contained in the cen- 
tre of A. As shown in [34], the same is true if C was any variety of 
fl-groups, X a subvariety in C, and the “ usual” central extensions 
are defined as in A.S.-T. Lue’s paper [53] (earlier a special case was 
studied by A. Frohlich [20], and after Lue these central extension were 
also studied by J. Furtado-Coelho [21]). On the other hand, as sug- 
gested by Commutator theory (see also 8.3 below) developed in uni- 
versal algebra, an extension (A, f) should have been called central if 
[A x A, A x B A] = 1a, i-e. if the commutator of the largest congruence 
on A with the kernel-congruence of / is trivial (the universal alge- 
braists would write [1,0;] = 0, where a denotes the kernel-congruence 
of /; they would also say that 1 and a centralize each other) - such 
extensions were called algebraically central in [36] and [37], in distinc- 
tion from the central extensions introduced in [34], which hence were 
called categorically central. Although neither Frohlich and his follow- 
ers, nor universal algebraists seemed to be interested to compare their 
notions of central extension with each other, or with the one intro- 
duced by S.A. Huq [26], the comparison with the categorically central 
extensions has been made in [36] and [37], and then by D. Bourn and 
M. Gran [6] and M. Gran alone in [22] and [23]. The results in fact 
came rapidly one after another as shown in the following: 

Theorem 7.1. List of Comparison 

Since the algebraically central extensions are defined using a com- 
mutator, and the most general case where all known definitions of 
commutator are known to be equivalent is the case of congruence 
modular varieties, we restrict ourselves to that case, i.e. we assume C 
to be congruence modular. Another reason for this restriction is the 
admissibility mentioned in Example 2.11. The algebraically central 
extensions should coincide with the categorically central ones if we 
take X to be the subvariety of abelian algebras in C, i.e. the algebras 
A with the property [A x A, A x B A\ = 1 a- The fact that every cate- 
gorically central extension is algebraically central, was proved in [36], 
and the converse was proved 

(a) for varieties of Q-groups in [36]. 

(b) then for Mal’tsev (=congruence permutable) varieties in [37], 
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(c) and finally for congruence modular varieties in [23]. 

Since the categorically central extensions were actually studied in [34] 
in the context of Barr exact categories (which is more general than the 
context of varieties), and since there is a categorical notion of com- 
mutator introduced by M. C. Pedicchio [58] for Barr exact Mal’tsev 
categories (which were introduced by A. Carboni, J. Lambek and M. 
C. Pedicchio [14]; see also A. Carboni, G. M. Kelly and M. C. Pedic- 
chio [13]), the two notions of central extensions can also be considered 
in any Barr exact Mal’tsev category. The fact that they coincide was 
proved 

(d) for semi-abelian categories (in the sense of [39]); in fact mainly the 
Bourn protomodularity [5] was used) in [6], 

(e) and then for Barr exact Mal’tsev categories in [22]. 

The situation with Huq’s notion of centrality is more complicated; it 
agrees with the others in “ nice” semi-abelian varieties, such as the 
varieties of groups or of rings, but not in general (see [7]). 

In the notation above, let us now assume for simplicity that C is 
a Mal’tsev variety, X any subvariety in C, B an object in C, and 
consider the question of existence of a universal covering of B. 

Lemma 7.2. Every fibration in C is a Cartesian-normal (monadic) 
extension, and every covering is a normal extension. 

Proof: The pullbacks involved in Corollary 3.4 are pullbacks of split 
epimorphisms, and therefore the functor I preserves them by [22, 
Proposition 2.1] - which proves the first assertion; the second one is 
proved in [34] (in fact in the more general case of Goursat varieties). 
Thus, if ( E,p ) is a fibration over B with free (or, more generally, 
projective) E then it yields a good way to define the fundamental 
groupoid of B , namely as I(Eq(p)) - and it is determined uniquely up 
to an internal equivalence by Proposition 5.4. Let us hence fix such 
an ( E,p ), and write 

P ) d ^ 

I(Eq(p)) = n(B) = (Il(B) 2 n (B), U(B) 0 ) (7.1) 

q ^ c ^ 

for the fundamental groupoid. The equivalence (4.1) then becomes 
Cov(B) ~ X n(B) n $. (7.2) 
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Considering groupoid actions as a special case of monoid actions in 
the appropriate monoidal category of spans, we can speak of n(£?) 
acting on itself via the multiplication, and write that object of X n ( B ) 
as (II(£)i,c, m). Moreover, since according to the construction of 
n(B), the morphism c is obviously a fibration, (II(B) 1 ,c,m) belongs 
to X n W n $ and therefore has the corresponding object (E^,p^) in 
Cov(B). 

Lemma 7.3. The fibration (E$,pfl) above is a projective weakly uni- 
versal covering. 

Proof: Since every fibration in C is a monadic extension, so is (E\ p$). 
The projectivity follows then from the equivalence (7.2) and the fact 
that (U(B)i,c,m) is a free II (B) - action. Furthermore, the projec- 
tivity implies weak universality by the second assertion of Lemma 7.2 
(or using the results of [34] directly). 

After that as an easy exercise with internal groupoid actions using the 
fact that the diagram 



I(E x B E) 



rightarrow 



1(E) -► 1(B) 



is a coequalizer diagram, we obtain 



(7.3) 



Theorem 7.4. The following conditions on an object B in C are 
equivalent: 

(a) B admits a connected universal covering; 

(b) the object I(B ) is projective in X; 

(c) the fundamental groupoid E1(.B) is internally equivalent to an in- 
ternal groupoid in which every morphism is an automorphism. 

Remark 7.5. (a) The covering (E\p^) considered above is nothing 
but the reflection of ( E,p ) into the category of coverings in the sense 
of [35], and, if X was the category of abelian objects in C, then due 
to the connection with the algebraically central extensions, it could 
be described as 

E^ = [E x E, E x B E] with the induced p $ - see [37]. 

(b) If C had a zero object (which is equivalent to the requirement that 
every algebra has a unique one-element subalgebra), and 1(B) was the 
trivial algebra, then the universal covering of B would be the initial 
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object in the category Cov(B), and that is what is usually called the 
universal central extension. In the case of D-groups the existence of 
universal central extensions is a part of what A. Frohlich’s school calls 
theory of Baer invariants; we already mentioned the papers [20], [53], 
[21], but much deeper links are to be established. 

8 Some open questions 

8.1 The classification theorem for covering spaces of a “ good” space is 
shown in detail in [4] to be a special case of what is presented here as 
Corollary 5.3 of Theorem 4.3. Does it mean that the notion of Galois 
structure is rich enough to be used for developing “ abstract homo- 
topy theory” = “ higher dimensional Galois theory” to be applicable 
to other examples of categorical Galois theory? If so, this approach 
should be in the same relationship with D. Quillen’s approach [60], as 
the general adjunctions with the adjunctions formed by the functors 
(2.2) and (2.4). One possible way of developing higher dimensional 
Galois theory is indicated by a very special example of so-called dou- 
ble central extensions of groups [31], further extended in [33]. 

8.2 A further generalization of Galois theory to so-called variable cate- 
gories is proposed in [41] and then an intermediate case studied in [44]. 
After that A. Joyal - M. Tierney’s theorem on presentations of geomet- 
ric morphisms of toposes [52] and the Tannaka duality become parts 
of (generalized) categorical Galois theory. However it seems that they 
should even be special cases of the “ straightforward” two-dimensional 
version of the very special situation studied in [11] and in [38]. 

8.3 The universal-algebraic notion of commutator (of two congruences) 
was invented by J.D.H. Smith [62] for Mal’tsev varieties, and then 
generalized in various directions by various authors, although all defi- 
nitions agree in the congruence modular case; some relevant references 
are given in [40] , where the commutators are described via internal cat- 
egorical structures ( “pseudogrupoids” - rather than “ pregroupoids” 
used in [58] in [58]). As explained in Section 6 above the notion of 
commutator theory perfectly agrees with the categorical notion of cen- 
tral extension, at least in the congruence nodular case. However this 
comparison makes sense only in a very special case of X being the 
category of abelian objects in C (see 7.1). Is it possible to define the 
notion of commutator with respect to any subvariety/" good” reflec- 
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tive subcategory X? This should probably be related with the theory 
of Bear invariants (see the references in 7.5 (b)). 

8.4 The existence of a universal covering proved in Section 6 for the 
Galois structures in Mal’tsev varieties, and in fact proved in [55] for 
the Galois theory of commutative rings, is also known in many cases, 
and the proof is especially simple if X = Sets. But what is the most 
appropriate general context for such a proof? 

8.5 Is there a convenient admissible Galois structure, large enough to 
make Magid’s equivalence (5.1) and the similar Grothendieck’s equiv- 
alence for etale coverings of schemes from [24] subequivalences of a 
larger equivalence? A good answer to this question will probably in- 
volve an infinite version of M. Barr’s Galois theory of commutative 
rings ([2], [3]) in a topos. It seems that some progress in this direction 
has already been made by B. Mesablishvili. 

8.6 Unlike the case of commutative rings, where Magid’s Galois theory 
is obviously “ the right one” , it is not at all clear what is the right level 
of generality for the theory of covering spaces. For instance, is it possi- 
ble to put together naturally (like in 8.5 above?) the locally connected 
and the compact case? Recall that these two cases are described in 
[4] (for the second one see also [11]) in the categorical language. Note 
also that a generalized theory of covering spaces may need further in- 
vestigations in topological descent theory, and especially of the etale 
descent, and we recall: 

(a) The complete description of effective descent morphisms (=monad- 
ic extensions) of topological spaces, was obtained by J. Reiterman 
and W. Tholen [61] in terms of ultrafilter convergence structure. This 
result is mentioned in [45], which together with [46] and [50] provides 
a general exposition of descent theory. 

(b) Examining the simple case of finite topological spaces (=finite 
preorders), a link with old Grothendieck - Giraud type results was 
established [42], and the effective etale descent morphisms of finite 
topological spaces were characterized [43]. There is a hope - thanks 
also to the recent work of M. M. Clementino, D. Hofmann, and W. 
Tholen - to generalize these results back to infinite spaces. 

8.7 Answering a question of S. MacLane, the relationship between 
the notion of Picard - Vessiot extension and the categorical Galois 
theory was established in [29], but no further investigations towards 
categorical Galois theory of differential rings/fields have been made, 
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and no connection with P. Deligne’s work [17] has been established. An 
observation about that is made by A. R. Magid in [56]. For instance 
I do not know the answer to E. Kolchin’s question: can the strongly 
normal extensions of differential fields (in the sense of E. Kolchin) be 
described categorically? 
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The Polarity between Approximation 
and Distribution 

M. Erne 



Abstract 

We provide a general framework for the sporadic observation that certain 
distributive laws for standard completions are equivalent to certain approx- 
imation properties of the underlying ordered sets, as known from the theory 
of continuous posets (domains). 

Every closure operation 8 on a locale (frame) L with join-dense range 
X gives rise to a polarity (Galois connection) induced by the relation 
q — {{y,z) : z < y => 8z < y}. Using that polarity, we show that the 
sublocales of L containing X are precisely the systems Z 6 of all Z-^-closed 
elements y (satisfying 8z < y whenever z £ Z and z < y) for so-called 
locally approximating subsets of L, i. e. subsets Z D X such that for all 
x £ X and z £ Z with x < 8z , there is a w £ Z with w < z and x = 8w. 
Thus, the complete lattice of all sublocales containing X is dually isomor- 
phic to the complete lattice of all Galois-closed locally approximating sets. 
Similarly, we show that in the case of a completely distributive lattice L, 
those meet-closed subsets which are completely distributive and contain 
X are precisely the systems of Z-8-closed elements for so-called strongly 
approximating subsets Z. 

These two polarity theorems have manifold applications in order theory 
and related fields, for example in domain theory and pointfree topology. 

Mathematics Subject Classification : 06A15 , 06B35 , 06D10 , 06D22. 

Key words: Approximation , Interpolation , Closure , Distributive , Continu- 
ous poset , Frame , Galois connection , Locale , Nucleus , Sublocale. 
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0 Introduction 

Approximation, interpolation and distribution are central notions of 
applied mathematics. In order to prevent misunderstandings a pri- 
ori , it should be pointed out that in the context we are concerned 
with, the use of these terms is rather general and perhaps a bit vague 
from the perspective of applied mathematics. However, theoretical 
computer science and other mathematically founded disciplines use 
ordered structures as modelling hierarchies of information, represen- 
tation, computation etc. In that context, certain auxiliary relations -< 
play a basic role, and it is a custom to call them approximating if each 
object is the join (supremum, least upper bound) or “limit” of all ele- 
ments preceding it with respect to -< (see, e.g., [18]). It is known from 
lattice theory, domain theory and related fields that such an approx- 
imation property is often equivalent to certain (infinite) distributive 
laws for the underlying structure or one of its completions. 

Notational Remark. Given an arbitrary relation -<, we shall use the 
abbreviations 

x^. = {y : x -< y} , X~< = {y : 3x e X(x -< y)} , 

-<z = {y : y -< z}, -<Z = {y : 3z G Z (y -< z)} . 

If the relation -< is idempotent, i. e. transitive and interpolating (the 
latter meaning that x -< z implies x < y < z for some y) then, by 
an old result due to Raney [23], the complete lattice of all -<-saturated 
sets U= U~< is completely distributive, i.e., arbitrary joins distribute 
over arbitrary meets. A frequently observed phenomenon is that “ap- 
proximation implies interpolation”. (We apologize for the “abuse” of 
terms that often have a specified meaning in applied mathematics.) 

To record two typical examples, let us recall that a complete lattice L 
is completely distributive iff L is supercontinous, that is, the relation < 
defined by 

x< y 44- y <\JW implies x < w for some w e W 

(where, as usual, V denotes the join) has the approximation property 
and, as a consequence, the interpolation property (see [23]). Equiv- 
alently, for each element x of L, there is a least downset (lower set, 
decreasing set, order ideal) whose join dominates x. Similarly, arbi- 
trary meets distribute over directed joins iff the way-below relation <C, 
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given by 

t<!/ y < V W implies x < \J F for some F C. U W 

(where F C u W means that F is a finite subset of W), has the ap- 
proximation and consequently the interpolation property [18]. In other 
words, L is a continuous lattice in the sense of Scott [24], Perhaps even 
more interesting is the fact that a dcpo or domain (a poset in which all 
directed subsets have joins) is continuous ( that is, for each element 
there is a least directed downset whose join dominates that element) 
iff the Scott topology (equivalently, the system of Scott-closed sets) 
is completely distributive (see [18, 21]). The Scott-open sets are then 
precisely those of the form £/<C. 

A similar characterization holds for lattices with completely distribu- 
tive ideal lattices. Indeed, a much more general theory has been 
developed for so-called Z- distributive and Z-continuous lattices and 
posets (see, for example, [4, 5, 7, 12, 22, 25]). Here, instead of arbi- 
trary, directed or finite subsets, one considers a general system Z (in 
[5, 7, 8, 14] denoted by M) of subsets of a poset P. The Z-join ideals, 
i. e., those downsets that contain with any member of Z its join (pro- 
vided it exists) form a closure system Z y . On the other hand, passing 
from Z to the system Z A of all Z-downsets 

\.Z = {y : y < z for some z € Z} 

where Z is a member of Z or a singleton {a:}, one may assume that 
Z is a standard extension, i. e. a collection of downsets including at 
least all principal ideals 

iy = {x : x < y} (ye P). 

Moreover, there is no essential loss of generality in assuming that all 
members of Z have joins - in other words, that the underlying poset 
is Z-complete. Then, it turns out that Z y is completely distributive 
whenever Z is strongly approximating, meaning that the Z -below ideals 

Vzy = r\{ZeZ*:y<\/Z} 

belong to and, moreover, the Z-below relation, defined by 
x y x e Ij - z y, 

has both the approximation and the interpolation property. Under 
rather mild hypotheses, the converse implication is true as well. (Con- 
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suit the aforementioned references, in particular [5] and [12], for de- 
tails.) 

Often, one has to work with ordered structures possessing weaker local 
approximation properties. For example, the ideal lattice of a join- 
semilattice S is known to be distributive (and even a frame) iff for 
all finite subsets Z and all x £ S with x < \J Z, there is a finite 
W C \.Z with x = \J W (cf. [8, 19]). Or, the Scott topology of a dcpo 
is a dual frame iff the corresponding condition is fulfilled for directed 
instead of finite subsets (for complete lattices, this is equivalent to 
meet- continuity ; cf. [14, 18]). More generally, it was shown in [8] that 
the Z- join ideals of a ^-complete poset P form a frame whenever Z 
is locally approximating , that is, 

Z £Z and x<\J Z imply x = \jW for some W £ Z with W C | Z. 

Again, under suitable weak assumptions on Z (for example, that Z h 
be closed under binary intersections), the converse also holds true. 
A recent result established in [14] is that all T 0 closure systems that 
are frames arise as Z- join ideal systems from locally approximating 
standard extensions. Similarly, we shall demonstrate in the present 
note that all completely distributive T 0 closure systems come from 
strongly approximating standard extensions in that manner. 

An appropriate and elegant concept for these investigations is that of 
Galois connections and, in particular, polarities in the sense of Birkhoff 
[3]. For surveys on that topic, see [15] and the article Adjunctions and 
Galois Connections: Origins, History and Development in this volume. 

Recall that polarities are pairs ($, 4/) of antitone (inclusion inverting) 
maps between power sets such that the two possible compositions are 
extensive. Any such Galois connection between power sets comes from 
a unique relation g between the underlying sets A and B such that 

<F(T) = Y e = {z £ B : y g z for all y £ Y} (Y C A) , 

\&(Z) = Z Q = {y £ A : y g z for all z G Z} (Z C B) . 

In the present situation, the polarity results from a relation canonically 
associated with a given closure operation. Often, this operation is the 
cut operator A of a poset P = ( X , <), assigning to each subset Y of 
X the intersection of all principal ideals containing Y. The crucial 
relation is then defined by 
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Y g Z Z C Y implies A Z C Y. 

One partner of the induced polarity, restricted to the completely dis- 
tributive lattice C of all downsets of P, maps any collection Z of 
downsets to the closure system 

Z A = {Y e C-.YgZ for all Z G Z} = {Y £ C : A Z Y C Y}, 

where 

A Z Y = U {AZ :ZeZ,ZCY}. 

The members of Z A are called Z-A-ideals. They coincide with the 
Z-join ideals provided P was 2-complete (see [10, 11, 12]). The other 
partner of the polarity sends any system y of downsets to the system 

y e = {Z e C : Y g Z for all Y e T} = {Z £ C : AZ C T y Z}, 

where 

T y Z = f]{Y ey : Z CY}. 

A standard extension Z is then Galois closed with respect to the above 
polarity iff Z = (Z A ) e , that is, Z 6 C and AZ C T z aZ imply Z £ 
Z. 

Our main result in that context will be that under the above polarity, 
those standard completions which are frames bijectively correspond to 
the Galois-closed, locally approximating standard extensions, and the 
completely distributive standard completions are in one-to-one cor- 
respondence with the Galois-closed, strongly approximating standard 
extensions. 

With regard to topological aspects, the local and global approximation 
properties we shall discuss might also be referred to as local or global 
density properties, respectively. 



1 The Polarity Between Sublocales and Locally 
Approximating Sets 

We are now going to study such polarities in a more general “frame- 
work”, namely that of arbitrary frames (locales) instead of lattices 
of downsets. A frame or locale is a complete lattice satisfying the 
distributive law 
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x A V Y = V(z A Y) (x G L, Y C L) 

where x A Y is a shorthand notation for {x A y : y G Y}. 

From the viewpoint of Galois connections, it is worthwile to note that 
a complete lattice L is a frame iff for each x G L, the unary meet 
operation z i — > x A z has a right adjoint, sending y to the relative 
pseudocomplement or residual 

x^-y = y:x = \/{z G L : x A z < y}. 

A subframe of L is closed under arbitrary joins and finite meets, 
whereas a sublocale Y of L is closed under arbitrary meets ( f\-closed ) 
and under residuation, the latter meaning that y : x lies in Y for all x G 
L and y G Y. For more background about frames, see, for example, 
[ 20 ]. 

Henceforth, we are fixing 

- a frame L with order relation <, 

- a closure operation 5 on L, and 

- a subset X of the range L s = 5[L\ that is join-dense in L. 

Thus, each element of L is a join of elements from X. For later use, 
we note the known fact that a join-dense subset X of L is /\-closed 
in L (that is, for each subset of X, the meet formed in L belongs to 
X) if and only if X is a complete lattice with respect to the induced 
order. It is an experience in order and category theory that often 
the reduction to join-dense subsets simplifies constructions and proofs. 
This will be a guiding principle in the subsequent considerations. Most 
of the definitions will involve the closure operation 8, and some of 
them also the join-dense subset X, although that dependence will not 
be mentioned explicitly. 

It will be convenient to write 8x for 8(x) etc. Recall the most succinct 
characterization of closure operations by the equivalence 

x < Sy 8x < 5y. 

Every subset Y of L gives rise to a closure operation 7 y defined by 
7 yx = f\{y eY :x <y}, 

and Y coincides with L lY iff Y is /\-closed in L. More generally, for 
every preclosure operation , that is, for every map (3 : L — > L satisfy- 
ing 
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x < y => (3x < (3y and x < (3x, 
the set 

L 0 = {x E L : (3x = x} 

of all fixpoints is /\-closed, and the corresponding closure operation 
7^/3 may be constructed by transfinite iteration of f3, setting 

= (3, f3 K+1 — (3 o /T, and (3 X x = \J{(3 K x : k < A} 

for limit ordinals A . By set-theoretical restrictions, there is a first 
ordinal k such that (3 = j3 , and this map will be denoted by (3. 
It is the smallest closure operation that dominates (3 (pointwise) and 
coincides, therefore, with the closure operation of L 13 = I/. In the 
case of (pre)closure operations on power sets, we shall speak, as usual, 
of (pre) closure operators , and /\-closed subsets of power sets VA are 
referred to as closure systems on A. (Often, that terminology is also 
used in the general context of complete lattices.) 

Given a fixed subset Z of L with X C Z, we frequently shall work 
with the preclosure operation 

S z : L — > L, y i — >\J{5z : z 6 Z,z < y}. 

Its fixpoints, referred to as Z-5- closed elements, form a /\-closed subset 

Z s = {y G L : z € Z and z < y imply 5z < y}, 

and the corresponding closure operation is 8 Z = Without partic- 
ular emphasis, we shall make use of the inequalities 

& Z y < ( 8 z ) K y < 5 z y < 5y for all y E L 

and the equalities 

5 z z = (5 z y z = 8 z z = 5z for all z £ Z. 

A fundamental ingredient for our investigations will be the relation 

q = {{y, z) G L x L : z < y => 5z <y}. 

To capture the dependence on S, the letter g may be read as an “upside 
down . The induced polarity consists of the two maps 

Y i — > Y e = {z £ L : Vy 6 Y (y g z)} = {z € L : 5z < 7 yz} and 

Z 1 — > Z s = {y G L \ 'dz € Z (y g z)} = {y G L : S z y < y} . 

Observe that the other inequality y < 5 z y holds by join-density of Z D 
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X, and that the inequality qyz < 8z holds whenever L s is contained 
in Y. In that situation, one has 

Y e = {z £ L : 8z = 7 Y z} . 

In particular, 

L s = L e . 

One of the two possible compositions gives the Galois closure 
Z & e = {zeL: 8z = 8 z z}, 

and Z is Galois closed if Z = Z Sq , which means that 8z = 8 z z implies 
z G Z. By the usual rules for Galois connections, 

Y CZ S Z CY S , 

each set of the form Y e is Galois closed, and if Z is Galois closed then 

Y = Z 6 =7 z = Yr 

Notice that Z 5 is always /\-closed and contains L s , whereas the ex- 
ample below shows that Galois-closed subsets need not even be A- 
subsemilattices (but see Lemmas 6 and 12). 

Example 1. Let D n denote the lattice of all divisors of the natural 
number n, and consider the downset frame L of D^. (In the present 
context, a downset contains with any number all divisors of it). The 
principal ideals of _D 36 (in other words, the divisor lattices D n for 
n | 36) together with the set {1,2,3} form a closure system Y of 
downsets, hence a /\-closed subset of L. Denote the corresponding 
closure operator by 7 and the restriction of the cut operator of Z) 36 to 
L by 8. Then, with respect to the polarity associated with <5, we have 
a Galois-closed set 

Z = Y e = {z e L : jz = 8z } . 

The downsets a = {1,2,3, 4} and b = {1,2, 3, 9} belong to Z since 
7 a — 8a = D \2 and jb = 8b = Di$, but a (lb is not in Z because 
7 (a fi b) = {1, 2, 3} / £>6 = 8 (a fl b ). Hence, Z is Galois-closed but 
not fl-closed. 

However, as in every Galois connection, the Galois-closed sets arising 
from any closure operation 8 via the relation q do form a closure 
system. 
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Henceforth, we are fixing a set Z with X C Z C L. First, we list 
some necessary and sufficient conditions ensuring that S preserves fi- 
nite meets of elements from Z (cf. [8] for the case of set-theoretical 
closure operators). Denote by A Z the A-subsemilattice of L generated 
by Z. 

Lemma 1. The following statements are equivalent: 

(a) y E L, z E Z and y < 5z imply y < S(y A z). 

(b) x EX, z E Z and x < 5z imply x = 8(x A z). 

(c) y A 5z\A ... A Sz n < S(y A Z\ A ...A z n ) for yEL and ZiEZ. 

(d) 5 preserves finite meets of elements from Z. 

(e) 5 induces a A-homomorphism from A Z to L 5 . 

Proof. (a)=>(b) is clear by the inclusion X CL 5 , which means x = 6x 
for all xEX. 

(b)=>-(c) is shown by induction. For n = 1, use join-density of X 
and the hypothesis that for x E X, the inequality x < y A 8z implies 
x = S(x A z) < S(y A z). 

Assume the claim has been verified for n. Then, for z n+ \ E Z, the 
previous argument and the induction hypothesis give 

y A bz\ A ... A 5z n A 5z n+ 1 < 8(y A 5z\ A ... A Sz n A z n+ 1 ) 

— A Z\ A ... A z n A Zn-^-i') A Z\ A ... A 

The implications (c) (d) (e) (a) are straightforward. □ 

By a pseudonucleus on L, we mean a preclosure operation f3 : L — > L 
such that 

x A y < z = /3z implies x A (dy < z , 
which is tantamount to the inequality 
x A fdy < f3(x A y ) , 

because (5{x Ay) is the smallest z with x Ay < z = (3z. A prenucleus 
(cf. [1]) is a preclosure operation f3 satisfying the slightly stronger 
condition 

x A(5y < (3{xAy), 

and a nucleus is a closure operation that preserves binary (hence fini- 
tary) meets. In the implication chain 
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nucleus 



$ 

extensive A-homomorphism 
prenucleus 
pseudonucleus 
preclosure operation 



none of the arrows may be inverted in general. However, an idem- 
potent pseudonucleus (3 is already a nucleus, since (3 = (3 entails 
x A (3y < (3{x A y) and then, replacing x with (3x, 

(3x A (3y < (3((3x Ay) < f3(3{x Ay) = /3(x Ay) < f3x A (3y. 

Next, we improve two known elementary facts, namely: (1) that for 
any prenucleus (3, the associated closure operator (3 is a nucleus [1], 
and (2) that the nuclei bijectively correspond to the sublocales [20]. 

Lemma 2. The followiny conditions on a preclosure operation (3 : 
L — > L are equivalent: 

(a) (3 is a pseudonucleus. 

(b) (3 is a nucleus. 

(c) L/P = L 33 = {z e L : f3z = zj is a sublocale of L. 

In particular, the dual isomorphism 7 1 — > L 1 between the complete 
lattice of all closure operations on L and that of all [\-closed subsets 
of L induces a one-to-one correspondence between nuclei and sublocales 
ofL. 

Proof. (a)=^(c). If z = f3z G L 13 then for y = z\x, one obtains 

x Ay < z, hence x A f3y < z, i.e. (3y < z:x = y G I/ 3 . 

(c) => (a). If x A y < z = f3z then y < z:x E L 13 , i.e. (3{z : x) < z : x 
and therefore x A (3y < x A j3(z:x) < z. 

The equivalence (b) (c) is now obtained for (3 instead of (3. □ 

If L 13 contains the join-dense subset X, one can say more: 




The Polarity between Approximation and Distribution 



183 



Lemma 3. Let (3 : L — >L be any preclosure operation whose set of 
fixpoints, Y = If , contains the join-dense subset X, and let 7 be the 
corresponding closure operation (thus 7 = /3 and Y = L 1 ). Then the 
following statements are all equivalent: 

(a) /3 is a pseudonucleus. 

(b) 7 is a nucleus. 

(c) 7 induces a frame homomorphism from L onto ZA 

(d) 7 preserves finite meets. 

(e) lA is a sublocale of L. 

(f) L 13 is a frame (with respect to the induced order). 

(g) x E X and y E L imply x A jy = 7(2 A y) . 

(h) x E X, y E L and 2 < 7 y imply 2 = 7(2 A y). 

( i ) 2 E X and y E L imply x A (3y < 7(2 A y) . 

( j ) 2 E X, y E L and x < (3y imply 2 = 7(2 A y) . 

Proof. In the implication circuit 

(a) =7 (b) =7 (c) =7 (d) =7 (e) =» (f ) =7 (g) =7 (h) =7 ( i ) =7 (j) =7 (a) , 

only the following three implications remain to be verified: 

(f)= 7 (g). Joins in the complete lattice Y are given by \f Y W = 
7 (V^ 0 - Let 2 E X and y E L. The inclusion X C Y and the 
join-density of X together with the frame property of L and Y yield 

7(2 A y) = jiyix A 2' : 2' G X, x' < y}) = 

\J Y { 2 A x' : 2' € X, x' < y} = 2 A \/ Y {x' G X : x' < y} = 

x A 'yiyix' e X : x' < y}) = 2 A 7 y. 

(h)= 7 (i). For x' £ X with 2' < 2 A f3y, it follows that 2' < 72/, hence 
2 ' = 7(2' A y) < 7(2 A y). Now, join-density of X yields 2 A fdy < 
7 {x Ay). 

(j)=7(a). Analogously. □ 

Our primary goal is to find handy conditions on J under which the set 
Z 6 of all Z-d-closed elements becomes a frame. As we shall see, it is 
sufficient (but not necessary) to require that 
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for all x E X and z E Z with x < Sz, 
there is a w E Z with w < z and x = 5w, 

in which case we say Z is locally approximating-, on the other hand, 
the same condition for w E L instead turns out to be necessary (but 
not sufficient) for Z s to become a frame. Therefore, we also need 
certain intermediate properties and call Z locally f3- approximating for 
any {3 : L — > L if 

for all x E X and z E Z with x < Sz, 
there is a w E L with w < z and x = f3w. 

Apparently, that condition may be condensed to the following: 

x E X, z E Z and x < Sz imply x = (3{x A z) , 

which in turn, under the hypothesis f3 < is equivalent to 

y E L, z E Z and y < Sz imply y < fi(y A z ) , 

by join-density of X C L 5 . Thus, the local /^-approximation prop- 
erty may be defined without reference to the selected set X. Observe 
also that Z is locally ^-approximating iff it satisfies the equivalent 
conditions in Lemma 1. 

Finally, call Z locally A- approximating if 

x E X, z E Z and x < Sz imply x = S(x A z) and x A z E Z 

and note the following implications: 

locally A- approximating 
locally approximating 

4 

locally S z -approximating 

_ 4 

locally S z -approximating 
locally S -approximating 

None of these implications may be inverted, not even in the rather 
specific “standard situation” known from domain theory, where L is 
the frame of all downsets of a dcpo, S is the cut operator, X is the set 
of all principal ideals, and Z is the standard extension of all directed 
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downsets (see the introduction and Section 4, and for counterexamples, 
[13]). 

However, if Z is closed under binary meets, the five properties in the 
above implication diagram are equivalent. (Indeed, x E X C Z,w < z 
and x = 8w entail w < x A z < x = Sx, hence x = 8{x A z) and 
x A z E Z.) 

Corollary 1. Z is locally 8 -approximating iff the A -sub semilattice 
generated by Z is locally (A-) approximating. 

Therefore, it is often possible to assume that Z be closed under finite 
meets (A -closed for short), which considerably simplifies the theory 
(see, for example, Corollary 2). 

Lemma 3, applied to (3 = 8 Z and 7 = 8 Z , yields the desired necessary 
and sufficient criteria for the frame property of Z s . 

Proposition 1. The following statements are equivalent: 

(a) is a pseudonucleus. 

(c) 8 Z is a nucleus. 

(d) 8 Z induces a frame homomorphism from L onto Z s . 

(e) Z 6 is a sublocale of L. 

(f) Z s is a frame ( with the induced order). 

(e) Z is locally 8 Z -approximating. 

Each of these conditions is fulfilled if Z is locally approximating. 

Lemma 4. A map (3 : L — > L with (3 < 8 satisfies the inequality 
x A 8 z y < f3{x A y) for all x,y E L 
iff Z is locally (3- approximating. 

Proof. If the stated inequality holds then x E X, z E Z and x < 8z 
imply x = xA8z = xA 8 z z < (3{x A z) < (3x < 8x = x. 

Conversely, suppose Z is locally /3-approximating. Since X is join- 
dense in L, we get for arbitrary x,y G L: 

x A 8 z y = x A \J{8z : z € Z,z < y} = \/{x A 8z : z G Z, z < y} 

= V{*' G X : 3z € Z (x ! < x A 8z, z < y)} 
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< \/{x' el : 3z e Z (x 1 = fl(x' A z ) < x, z < y)} 

< /3(x A y). □ 

The special case /5 = 8 Z amounts to 

Proposition 2. 8 Z is a prenucleus iff Z is locally S z -approximating. 

In case X is the least join-dense subset, consisting of all \/-irreducible 
elements (a situation to be discussed more thoroughly in Section 4), 
we have another convenient coincidence: 

Lemma 5. If X consists of \/ -irreducible elements then every locally 
5 Z -approximating subset Z is already locally approximating. 

This holds because for V-irreducible x € X C Z, 

x = S z w = \/{5v : veZ,v<w} implies 
x = 8v for some v^Z with v<w. 

As a consequence of the previous facts, we note: 

Proposition 3. 8 Z is a nucleus (and Z 5 is the associated sublocale) 
iff Z is locally 8 Z -approximating and 8 Z [L\ is contained in Z 6 . If X 
consists of\J -irreducible elements then these two conditions are also 
equivalent to the local approximation property for Z combined with 
idempotency of 8 Z . 

Corollary 2. If Z is A-closed then the following conditions are equiv- 
alent: 

(a) Z 6 is a frame (and a sublocale of L). 

(b) 8 Z is a nucleus. 

(c) 8 Z is a prenucleus. 

(d) 8 induces a A-homomorphism from Z into L s . 

(e) Z is locally (A-) approximating. 

An example in [13] shows that if Z fails to be A-closed, it may be 
locally approximating but not locally A-approximating. 

Recall that Z is Galois-closed iff 8z = 8 z z implies z E Z (and that 
the converse implication is always true). For Galois-closed sets, the 
situation is simpler, on account of 
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Lemma 6. Every Galois-closed locally S z -approximating subset Z is 
A-closed, hence locally A- approximating. 

Proof. For y,z £ Z, we compute, using Proposition 1 : 

5(y A z) < 5y A Sz = S z y A 8 z z = S z (y A z ) , 

Hence, y A z is a member of the Galois-closed set Z. Clearly, the top 
element of L belongs to any Galois-closed subset. □ 

We now show that every sublocale containing L 5 arises in the form 
Z s . Simultaneously, we generalize the fact that L s is a sublocale iff L 
is locally approximating (see Lemma 3). 

Proposition 4. A subset Y of L is a sublocale containing L s iff Z = 
Y Q is locally (A-) approximating and satisfies Z s = Y. 

Proof. Z = Y e = {z £ L : Sz — ^yz} is Galois closed. Suppose Y 
is a sublocale with L s C Y and consider elements x £ X, z £ Z with 
x < Sz. As 5z = 7 yz, Lemma 3 yields 

x = 7 y(x A z) < S(x A z) < Sx = x, 

and so x = 'yy(xAz) = S(xAz), whence xAz£Y 8 = Z. This already 
shows that Z is locally A-approximating. The inclusion Y C Y eS = Z 5 
is clear. For the converse inclusion, consider y £ Z 5 and x £ X with 
x < 7 vy. Setting z = x Ay, we obtain (using Lemma 3 as before) 
x = 7 yz = Sz, whence z £ Y 8 = Z, and z < y yields x < S z y = y (by 
the choice of y £ Z 6 ). Again by join-density of X, we conclude that 
ivy < y e Y. 

On the other hand, Proposition 1 tells us that for all locally approx- 
imating subsets Z, the set Y = Z 5 is a sublocale (with L s C Y). 

□ 

Corollary 3 . Z is Galois closed and locally (A-) approximating iffY — 
Z 6 is a sublocale of L with Z = Y 8 . 

Let us call an element z £ L 6 -nuclear if for all x £ X, x < Sz 
implies x = S(xAz). Alternatively, we may describe 5-nuclear elements 
without reference to the join-dense subset X, by postulating that y < 
Sz implies y < S(y A z), or even shorter, that y A Sz < S(y A z) for 
all y £ L. Clearly, each fixpoint of S is 5-nuclear, and by Lemma 3, S 
is a nucleus iff each element of L is 5-nuclear. Furthermore, we note 
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that a subset is locally ^-approximating just iff it consists of (j-nuclear 
elements, and conclude: 

Lemma 7 . A Galois-closed subset of L is locally approximating iff 
it consists of 5 -nuclear elements. Hence, the Galois-closed locally ap- 
proximating subsets of L form a closure system whose greatest member 
consists of all 5 -nuclear elements. 

Proof. Only the last claim requires a verification. The set N of all 
5-nuclear elements is A-closed by Lemma 1. It is a sublocale by Propo- 
sition 1, and by Proposition 4, its closure N 6g is a Galois-closed locally 
approximating subset containing N. By maximality, N must coincide 
with its Galois closure. □ 

Summarizing the previous results, we arrive at 

Theorem 1. Let L be a frame with a specified closure operation 5 
whose range L 5 is join-dense in L. Then the sublocales containing L s 
are precisely the sets of all Z-S-closed elements for locally approximat- 
ing sets Z. 

Furthermore, the polarity associated with S induces a dual isomor- 
phism between the closure system of all sublocales of L containing L s 
and the complete lattice of all Galois-closed locally approximating sub- 
sets of L. 

Corollary 4. The sublocales of L containing L 6 are precisely the sets 
Z 8 where Z runs through all A - subsemilattices of L consisting of S- 
nuclear elements. Moreover, the 6 -nuclear elements form the largest 
sublocale Y of L so that 6 restricted to Y becomes a nucleus. 

2 Completely Distributive Lattices and 
Approximating Subsets 

We now turn to the “global” analogues of the previous results, with 
the same hypotheses as at the beginning of Section 1, but this time 
focussing on maps and distributive laws that involve infinite meets. 
Although, at first glance, one might conjecture that the main steps in 
the local theory developed in Section 1 have their strict analogues for 
the global situation (and this will be justified in a few instances), there 
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are some situations where the global analogues fail at least partially 
or are more complicated. For example, Lemma 3 has no exact coun- 
terpart for maps that preserve arbitrary meets. To be more explicit: it 
is not true in general that if a prenucleus (3 preserves arbitrary meets, 
the associated nucleus (3 would also preserve arbitrary meets. 

Example 2. Consider the complete chain 
L = {+n :new + l= wU {u;}} 

and the prenucleus (3 with (3oj = to, (3 {—oj) = — uj and (3n = n + 1 
otherwise. Apparently, (3 preserves arbitrary meets. But this no longer 
holds for the nucleus 7 = (3, because y(— to) = —to, whereas 7 x = co for 
all other x € L. In the complete lattice of all nuclei on L, we observe 
that 7 is the join of the nuclei 7 „ with 7 n n — n + 1 and j n x = x 
otherwise, where n runs through the integers, and each j n preserves 
arbitrary meets! 

As in the situation described in the previous section, we consider a 
join-dense subset X of a complete lattice L and a subset Z of L con- 
taining X. Inspired by the theory of ^-continuous posets and their 
Z-below ideals (cf. [4, 12, 22, 25, 26] and Section 4), we define the 
Z-core of y E L (with respect to the given closure operation S) by 

$zy = /\{z € Z :y < Sz}, 

and the Z-below part of y by 

QzV = \J{5zX : x E X,x<y}. 

Recall that x < y means that the element x belongs to each downset 
whose join dominates y, and note that Szx coincides with g z x when- 
ever a; is a \/-prime element of L (i. e. x<x £ X). The least /\-closed 
subset containing Z is given by 

AZ = {f\W : W C Z}. 

From the definition of Sz, one concludes that Sz = S\z and that 
$zy < SzSy < y for all y e AZ. If 5 is the identity map then Sz is the 
closure operation 7 ^ associated with Z. Also, by definition, we have 
QzV — &zy and g z y < y for all y E L. Moreover, since x <\JW is 
equivalent to x <1 w for some w G W, we see that g z preserves joins. 
In all, we have: 
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Lemma 8. g z is a dual preclosure operation that preserves arbitrary 
joins. It has therefore a right adjoint that is a preclosure operation 
preserving arbitrary meets, in particular a nucleus. 

In Proposition 6, we shall find necessary and sufficient conditions un- 
der which the right adjoint of g z is just 5 Z . 

Let us begin our investigation with the global counterpart of Lemma 

1 . 

Lemma 9. The following statements are equivalent: 

(a) y < 85 z y for each y e L. 

(b) x — 58 z x for each x e X . 

(c) y A /\ 8\W] < 8{y A f\ W) for all y G L and W C Z. 

(d) 8 preserves arbitrary meets of elements from Z. 

(e) 5 induces a f \-homomorphism between A Z and L 5 . 

(f) The restriction 8 : A Z — > L s has a left adjoint (5 Z ). 

Proof. The implication circle (a) =>• (f) =4> (e) =>- (c) (d) =>■ (b) =4> (a) is 

easily established: for (a)=^(f), recall that 5 z 8y < y for y £ A Z. For 
(b)=>(a), use join-density of X, and for (e)=^(c), apply the corre- 
sponding implication in Lemma 1 to z\ = f\ W (n = 1) and A Z instead 
of Z, using the equation 8z\ = f\8\W\. □ 

Next, let us note that the following two conditions on a subset X of a 
complete lattice L are equivalent: 

• L is completely distributive and X is join-dense in L 

• X is a strong base, i.e. y = \J{xEX : x < y} for all y£L. 
Henceforth, we shall assume that 

L is a completely distributive lattice with a strong base X C Z. 

Consequently, the relation < is interpolating. As for frames, there 
is a nice characterization of complete distributivity in terms of Galois 
connections: a complete lattice L is completely distributive iff the join 
map V from the downset lattice onto L has a left adjoint (viz. the 
“well-below” map y i — X y). Clearly, in every complete lattice, the 
join map has a right adjoint, the principal ideal map y i — > fy = {x £ 
L: x <y}. 
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As mentioned earlier, Lemma 3 can be transferred to the global level 
only in part. 

Lemma 10. For a closure operation 7 on L with join-dense range 
Y = LP , the following statements are equivalent: 

(a) 7~ 1 [^2/] is a principal ideal for each y G L. 

(b) 7 has a left adjoint. 

(c) 7 induces a complete homomorphism onto Y . 

(d) 7 preserves arbitrary meets. 

(e) Y is a completely distributive sublocale of L. 

(f) Y is a completely distributive lattice. 

Hence , a join-dense subset of L is a completely distributive complete 
lattice ijf it is a sublocale whose nucleus preserves arbitrary meets. 

Proof. The equivalences (a) •<=>• (b) (c) (d) are well-known (cf. [15]). 
For (c)=^(e), apply Lemma 3 and the fact that complete homomor- 
phisms preserve complete distributivity. Clearly, (e) implies (f). 
(f)=^(d). Let W C L and put Y w = {y G Y : y < w} for w G W. 
Then each Y w is a downset in Y. By join-density of Y, we get 7 w = 
7 (VYw) = Vy^w, and by complete distributivity, 

A(7 w : w e W} = A y {V y Y w : w e W} = 

VrfKn. :weW} = y r {yeY : y < /\ W} = 

7(V{sel': V <A»’}) = 7(A»')-n 

Without the density assumption, condition (f) is strictly weaker than 
the other statements (a) - (e). 

Now, we formulate the global counterparts of the local approximation 
properties introduced in Section 1. Given a map f3 : L — >L with 
P < 8, we say Z is (globally or uniformly) f3 -approximating if 

x = P8zx for all x £ X, 

(globally or uniformly) approximating if 

x = 88zx and 5zx 6 Z for all x G X, 

and strongly ((3-) approximating if, in addition, 8 Z is idempotent (hence 
a closure operation). Note that corresponding local and global approx- 
imation properties differ only by the position of quantifiers: 
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Z is locally approximating iff 

Vx€XVz£Z3w£Z(x = Sw and (x<8z w<z)), 

Z is (globally) approximating iff 

VxEX3wE.ZVz€.Z(x = Sw and ( x<Sz =>- w<z )), 

and similarly for the /3-approximation properties. In particular, every 
(/?-) approximating set is locally (/?-) approximating. Notice that in 
the second description of approximating sets, w must be the Z-core 
of x. But even a strongly approximating set need not be locally A- 
approximating (the condition xAz € Z for z E Z may fail; see Example 
4 at the end). However, Z is approximating iff so is A Z, and clearly 
x A z is in A Z for x £ X and z € A Z. Hence, in many instances of the 
global approximation theory, there is no essential loss of generality in 
assuming that Z be /\-closed. 

By definition, Z is ^-approximating iff it satisfies the equivalent con- 
ditions in Lemma 9. In particular, the whole lattice L is (strongly) 
approximating iff 5 preserves arbitrary meets, i.e. L s is completely 
distributive (cf. Lemma 10.) 

Similarly as in the local case, we have the following implications be- 
tween the various global approximation properties: 



strongly approximating 
(globally) approximating 

4 

S z -approximating 

_ ^ 

S z -approximating 

$ 

S -approximating 

If Z is /\-closed then 5 z x belongs to Z for all x e X, so that the last 
four approximation properties are equivalent. From Lemma 9 and the 
equation Sz = Saz, we immediately infer: 

Corollary 5. Z is /3 -approximating iff so is the f\-closed subset A Z 
generated by Z. In particular, the following three statements are equiv- 
alent: 
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(a) S preserves arbitrary meets of elements from Z. 

(b) Z is 5 -approximating. 

(c) A Z is (5-) approximating. 

Lemma 9, applied to the closure operation 6 Z , amounts to 

Proposition 5. Z 6 is completely distributive (and a sublocale of L) 
iff S z has a left adjoint, that is, 5 Z preserves arbitrary meets (and 
induces therefore a complete homomorphism onto Z s ). Each of these 
conditions implies that Z is 5 Z -approximating. 

Likewise, the global counterpart of Proposition 2 involves certain ad- 
joint maps. 

Proposition 6. The following conditions are equivalent: 

(a) q z is left adjoint to S z . 

(b) 6 Z preserves arbitrary meets. 

(c) Z is 5 Z -approximating. 

Proof. (a)=^(b). Right adjoints preserve meets. 

(b) =^(c). By the inclusion X C L s , we get for x E X: 

S z 5zx < Sx = x < f\{Sz : z e Z,x< 8z} = 
f\{5 z z : z e Z,x< 8z} = 5 z 5zx. 

(c) =^(a). Suppose y < 5 z w = \J{8z : z G Z, z < w}. For x G X 
with x < y, we find a z E Z such that z < w and x < 8z. Hence 
Szx < z < w, and g z y = \/{8zx : x G X, x < y} < w. (For this part, 
the hypothesis (c) is not needed.) 

Conversely, assume g z y < w, i. e. 8zx < w for all x 6 X with 
x < y. Any such x satisfies x = 8 z Szx < 8 z w. By the strong base 
property of X, it follows that y = \/{x £ X : x < y} < 5 z w. In 
all, we have established the required adjunction equivalence g z y < 
w <=? y < 8 z w. □ 

In analogy to Lemma 5, we have 

Lemma 11. If X consists of \J -irreducible elements then every S z - 
approximating subset is already approximating. 




194 



M. Erne 



Proof. If x is \/-irreducible then x = S z 5 zx implies x = 8z for some 
z e Z with z < Szx. It follows that 5zx = z G Z and x = 88 zx. □ 

The conditions in Proposition 6 imply that Z is (^-approximating, 
but what about the converse? We have not settled that problem in 
general, but results in [12] show that under reasonable circumstances 
(in domain theory, say), the properties of 8 Z - and (^-approximation 
are equivalent. 

Combining the previous results with the fact that one partner of an 
adjunction is idempotent iff so is the other, we obtain: 

Proposition 7. The following statements are equivalent: 

(a) Z s is the range of 8 Z and completely distributive. 

(b) 8 Z is a closure operation preserving arbitrary meets. 

(c) 8 Z induces a complete homomorphism from L onto Z 5 . 

(d) g z is idempotent and left adjoint to 8 Z . 

(e) Z is strongly 8 Z -approximating. 

Each of these conditions is fulfilled when 

(f) Z is strongly approximating, 

and if X consists of \f -irreducible elements, (f) is also a consequence 
of the other conditions. 

From Corollary 5 and Propositions 6 and 7, we infer: 

Corollary 6. Let Z be a /\- closed subset of L containing L 5 , and 
consider the following conditions: 

(a) Z s is completely distributive. 

(b) Z s is a sublocale whose nucleus preserves arbitrary meets. 

(c) 5 Z induces a complete homomorphism from L onto Z s . 

(d) 8 Z preserves arbitrary meets. 

(e) 8 induces a f\-homomorphism from Z onto L 6 . 

(f) Z is approximating. 

In general, (a) (b) •O’ (c) =>• (d) 4=> (e) 4=> (f ) . If 8 Z is idempotent, all 

six conditions are equivalent to the strong approximation property of 

Z. 
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Thus, for /\-closed Z, the situation becomes considerably simpler, but 
remains still more complicated than in Corollary 2. Example 2.7 in [5] 
shows that even for /\-closed approximating subsets Z, the /\-closed 
subsets Z 6 need not be completely distributive, although we know 
from Proposition 3 that Z 5 must be a sublocale, hence a frame in its 
own right. Another example in [5] shows that complete distributivity 
of Z 5 is not sufficient for idempotency of 8 Z . 

The global counterpart of Lemma 6 is more involved, too. What we 
need is a handy criterion ensuring idempotency of 5 Z or g z , respec- 
tively. The following definition will be helpful: Z is (g z ~) invariant if 
Qz[Z\ U Zi . 

Lemma 12. (1) If Z is 7 - approximating for some closure operation 
7 on L with 7 <8 then 7 = 7 g z - 

(2) Every invariant 8 -approximating subset Z of L is strongly approx- 
imating. 

(3) Every Galois-closed (8 Z -) approximating subset Z of L is f\-closed, 
invariant and strongly approximating. 

Proof. (1) 7 y = 7(V( a; £ X : x < y }) = 

7(V(7^ : x E X,x < y}) = 

'r(y{Szx : x E X,x< y}) = 7 g z y. 

(2) x E X C Z implies z := PzQz x £ Z, by invariance of Z. Using 
(1) for 8 instead of 7, we get Sz — 8 g z x — 8x — x. The definition of 
8 Z yields 8 z x < z < g z 8 z x < 8 z x, and it follows that 8 z x = g z 8 z x = 
z E Z and 88 z x = x. Thus, Z is approximating. Furthermore, 

QzV = yiSz^zx -x<y} < g z g z y < g z y , 
establishing idempotency of g z and so, by Proposition 7, of 8 Z . 

(3) Put 7 = 8 Z . Galois-closedness of Z means that z E Z is equivalent 
to 7 z = 8z. If Z is 7-approximating then, by (1), 

1 g z z — 8 Q z z <8z = jz = ' yg z z , 

whence g z z E Z. Thus, Z is invariant and (2) applies. We con- 
clude that 7 coincides with 8 Z and preserves arbitrary meets (again 
by Proposition 7). In particular, for W C Z, we obtain 

A7[W'] = 7(A«0 < 6(AW) < AW = AW 
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and therefore f\W E Z. □ 

After these remarks about invariant and Galois-closed subsets, we are 
now in a position to establish the global version of Proposition 4. 

Proposition 8. A subset Y of L contains L 5 and is completely dis- 
tributive (hence a sublocale) iff Z = Y e is strongly approximating and 
satisfies Y = Z 6 . In particular, L 5 is completely distributive iff L is 
strongly approximating. 

Proof. If Y is of the form Z s for a strongly approximating set Z 
then, by Proposition 7, Y is a completely distributive sublocale with 
Y D L s . 

Now assume that, conversely, Y contains L 6 and is completely dis- 
tributive, hence a sublocale with nucleus 7 . We have already re- 
marked that Z = Y e is Galois closed. For x E X C Y n Z, we 
get 'jSzx < 85zx < Sx = x and 

x < A {*z:zeZ,x < Sz} = f\{^z : z E Z,x < Sz} = 7 S z x, 

because 7 preserves meets (Lemma 10). Hence, each x E X satisfies 
7 Szx = 5Szx = x and 8zx E Z. This together with Lemma 12(3) 
shows that Z is strongly approximating. The equation Y = Z 5 follows 
from Lemma 10 and Proposition 4. □ 

Corollary 7. Z is Galois closed and strongly approximating iff Y — 
Z 5 is a completely distributive sublocale of L with Z = Y e . 

There is no exact counterpart of Lemma 7 for approximating sets, be- 
cause neither the completely distributive sublocales nor the approxi- 
mating Galois-closed subsets of L always do form a complete lattice. 
Still worse, Example 3 will demonstrate that in general, there is no 
least completely distributive sublocale containing L s and no greatest 
approximating Galois-closed subset of L. 

Summarizing the facts established in Propositions 7, 8 and Corollary 
7, we may now formulate the global version of Theorem 1. 

Theorem 2. Let L be a completely distributive lattice with join-dense 
subset X, and let 5 be a closure operation on L with X C L s . Then 
a subset Y of L contains L 5 and is a (f\-closed) completely distribu- 
tive lattice iff there exists a strongly approximating subset Z of L with 
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Y = Z s . Moreover, the assignment Z i — > Z s yields a dual order iso- 
morphism between the system of all (strongly) approximating Galois- 
closed subsets of L and the system of all completely distributive fa- 
closed subsets (sublocales) of L containing L 5 . 

We conclude this section with the remark that many of the previous 
results and their local counterparts admit a common generalization 
to the situation of meets formed by fewer than k elements, where k 
is a regular infinite cardinal. For example, call a subset Z of L k- 
approximating if for each x G X and each subset W of Z with fewer 
than k elements, x < /\5[VF] implies x = 8z for some z e Z with 
z < fa W, and call a complete lattice k- distributive if V D 3^ = A VtXI 
for all systems y of less than k downsets. Then, among other facts 
generalizing those obtained before, one may show that if a /\-closed 
subset Y of a ^-distributive complete lattice with closure operation 8 > 
7 y is ^-distributive, too, then the set Z = Y e is ^-approximating and 
satisfies Y = Z s ; and the converse holds at least if 8 Z is idempotent. 



3 The Z-below Relation and the Interpolation 
Property 

As before, L denotes a completely distributive lattice with strong base 
X, and Z a subset of L containing X. Following a central idea in the 
theory of ^-continuous posets (see the introduction and Section 4), 
one is tempted to introduce a Z-below relation on L by x<zU <=> x < 
5 z y or by x <\ z y ^ x < g z y , but neither of these attempts does 
lead to the desired results. It turns out that the “right” definition of 
the Z-below relation is the following: 

x< zV ^ x< QzU 

which, by the approximation and interpolation properties of the rela- 
tion < is tantamount to 

x<\ z y O x< $z x ' < x' < y for some x' E X . 

On the other hand, <z determines g z by the equation 

QzV = Vi u € L ■ u< z y}- 

The join-preservation property of g z together with the equivalence 
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x <\JW x < w for some w £ W 
entails the rule 

x < z\/W x < zw for some w £ W. 

By an auxiliary relation (cf. [18]) on a poset P — ( X , <), we mean a 
relation -< such that 

u < v -< w < y =>- u -< y =>• u < y. 

In particular, any auxiliary relation is transitive (but we do not re- 
quire, as in [18], that the sets {a; : x -< y} be directed). Thus, an 
auxiliary relation is idempotent iff it has the interpolation property. 
It is easy to see that the relations < z are always auxiliary and, in 
particular, transitive. As we have seen in Section 3, it is important 
to have good information about conditions under which the maps 6 Z 
and g z are idempotent. 

Lemma 13. The relation < z is idempotent (interpolating) iff the map 
q z is idempotent (hence a dual closure operation). Sufficient for idem- 
potency is the equation 

5 z x = g z 5 z x for all x £ X, 

and this is also necessary if the elements of X are \J -prime. 

Proof. Recall that g z y = \/{u : u<zy} (y £ L). Hence, if <z is 
idempotent then 

QzQzV = V{“ ■■ u< z \/{v : v< z y}} = 

\J{u : u <\ z v < z y for some v} — \J{u : u < z y} = g z y . 

Conversely, suppose g z is idempotent. Then, as g z preserves joins, 
u< z y u< g z y = \/{g z v : v < g z y) 

u < g z v for some v < g z y u <a < zy for some v. 

By definition of g z , the equation 5 z x = g z ^zx for all x £ X entails 
QzV = V{Qz s zx : x £ X,x< y} < g z g z y < g z y, 

hence idempotency of g z . On the other hand, if each x £ X is \J- 
prime, i.e. x < x, then S z x = g z x, and consequently, idempotency of 
g z entails 5 z x = g z S z x. □ 
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Invoking Proposition 7 once more, we obtain the remarkable coinci- 
dence of three idempotency conditions. 

Corollary 8. Suppose Z is approximating. If one of the relations, 
respectively, functions <z , Qz and, 6 Z is idempotent then so are the 
others. 

This together with Lemma 12 (2) confirms the introductory remark 
that often “interpolation follows from approximation” . 

We prove two technical lemmas that simplify arguments in order to 
ensure idempotency. 

Lemma 14. Let n be a positive integer and 7 a closure operation on 
L with 7 < 5. Then the following statements are equivalent: 

( a ) x = 7 5zx for all x £ X ; that is, Z is 7- approximating. 

(b„) x = 'yg z n 5zx for all x £ X. 

(c n ) x = r yg z n x for all x £ X. 

Proof. (a)=^(b„). Put x n = g z n d z x. Thenx = jxq , and the equations 
jx n +i = 7 Qz x n = 7(V{^' : x 1 £ X,x’ < x n }) = 

7(V{7<^' : x' £ X, x' < x n }) = 
l{\J{x' £ X : x' < x n }) = jx n 
show (via induction) that x = jx n for all n £ u>. 

(b n ) =>• (c n ) . Since all occurring maps are isotone and g z is contractive, 
we obtain x = 'ypzfdzx < - ig z x <7 x < Sx = x. 

(c n ) =>-(a). x = 7 g z n x < 7 g z x < 7 8 z x <7 x < Sx = x. n 

Lemma 15. Suppose Z is S -approximating. Then, for any integer 
n > 1 and any element x £ X, the following are equivalent: 

(a) g z S z x £^Z. (a 7 ) S z x = g z S z x £Z. 

(b n ) Qz'Szx £ Z. (b(j) S z x = g z 8 z x £ Z. 

(c n) g z n+lx e Z. «) S z x = q z n+1 x £ Z. 

Proof. (a)=>(a'). For z := g z &z x £ Z, we have x = Sz by Lemma 14 . 

Hence, S z x < z and z < S z x. 

(b n )=^(b(J and (c n )=>(c' n ) are shown analogously (using Lemma 14 ). 
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(a')=^(c(J. By induction, 8 z x = g z 8 z x yields 
8 z x = Q z n+1 $zx < g z n+1 x < g z 2 x < g z 8 z x = 5 z x. 

( C n)=K b n)- S zX = QzQz x < Qz?>zX < S Z X. 

(b(J=^(a'). S z x = g z 8 z x < g z 8 z x < 6 z x. □ 

Applying the last lemma to the case n = 1 and invoking Lemma 13 
leads to 

Corollary 9. Z is 5 -approximating and satisfies g z Qz £ Z f or 
x EX iff Z is approximating and g z &zx = S z x for all x e X. These 
two conditions are equivalent to the strong approximation property if 
X consists of \/ -primes. 

Now, combining the results in Section 3 with those in Section 2, we 
finally arrive at 

Theorem 3. Consider the subsequent conditions: 

(al) Z is S -approximating and invariant. 

(a2) Z is 5 -approximating and g z g z x £ Z for all x e X . 

(bl) Z is approximating with interpolating Z-below relation. 

(b2) Z is approximating and g z g z x = Qz x f or x e X . 

(cl) Z is strongly approximating. 

(c2) Z is approximating. 

(dl) Z is strongly 8 Z -approximating. 

(d2) Z is S z -approximating. 

(el) Z 5 is a completely distributive sublocale with nucleus 8 Z . 

(e2) 8 Z preserves arbitrary meets. 

(fl) S z preserves arbitrary meets. 

(f2) Z is S z -approximating. 

(gl) S z has a left adjoint. 

(g2) 5 preserves meets of subsets of Z . 

(hi) Z s is completely distributive (and a sublocale). 

(h2) Z is 8 -approximating. 
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In general, one has the following diagram of implications: 

(al) =>- (bl) 77 (cl) =7- (dl) 77 (el) =7> (fl) 77 (gl) ^ (hi) 

(a2) 77 (b2) =7- (c2) =7- (d2) 77 (e2) =7 (f2) =7- (g2) 77 (h2) 

If each x £ X is \f -prime then the conditions (a2), (bl), (b2), (cl), 

(dl) and (el) are all equivalent, and so are (c2), (d2) and (e2). 

If Z is f\-closed, (c2), (d2), (e2), (f2), (g2) and (h2) are equivalent. 

If Z is invariant, all 16 conditions are equivalent. 

4 Standard Extensions and Completions 

Many effective applications of the previous results are obtained in the 
following “standard situation”, alluded to already in earlier sections. 
Let P = ( X , <) be an arbitrary poset, £ the completely distribu- 
tive frame of all downsets, A the cut operator, and X the set of all 
principal ideals. Hence, X is contained in the range of A, called the 
MacNeille completion, normal completion or completion by cuts of P 
and denoted by M. The condition X C Z C £ means that Z is a 
standard extension. By definition, Z A is the system of all Z-A-ideals 
(i.e. downsets Y such that A Z C Y for all Z G Z with Z CY; see the 
introduction and [10, 11, 12]). Thus, Z A is always a standard comple- 
tion, that is, a closure system of downsets that contains all principal 
ideals (cf. [16]). Note that every T 0 closure system is the standard 
completion of a unique poset. (Take the specialization order given by 
x < y iff x belongs to the closure of y .) The largest standard comple- 
tion is the Alexandroff completion £, often also denoted by A. It is 
not only completely distributive but even superalgebraic: its \/-prime 
(= supercompact) elements are the principal ideals and form the least 
join-dense subset X. 

Conversely, if L is a complete lattice whose \/-prime elements form a 
join-dense subset X then this is the smallest one, and L is isomorphic 
to the Alexandroff completion £ of the subposet P = (A, <), where 
< is the order induced from L. In fact, the join map \j : £ — > L is an 
isomorphism sending X to X and Af to AX. The closure operation 
7x associated with X is, up to isomorphism, the cut operator A of P, 
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via the equation 7 *oV = V°A. 

A standard extension Z is locally A-approximating iff for all Z £ 
Z and x £ X fl A Z there is some W C Z with x = \J W, locally 
approximating iff, in addition, W may be chosen from Z, and locally 
A-approximating iff, moreover, one may take W = ^x fl Z. Similarly, 
Z is A-approximating iff for all x £ X there is a W such that x = \J W 
and W C Z for all Z £ Z with x £ A Z, and approximating iff, in 
addition, W may be taken from Z. In the latter case, W must be the 
(modified) U-below ideal 

if z x = f \{Z eZ-.xe A z} 

which is the Z- core and also the Z-below part of the principal ideal 
generated by x. Recall that a strongly approximating standard exten- 
sion is one such that A 2 is idempotent, i.e., the sets 

A Z Y = (J{AZ : ZeZ, ZC±Y) 

are always iGA-ideals. Thus, a standard extension Z is (strongly) 
approximating iff the underlying poset is (strongly) Z-precontinuous in 
the sense of [12]. Also, the ,2-below relation between downsets extends 
the original U-below relation between elements as defined in [12]. The 
crucial role of the Z-below maps in the theory of ^-continuous posets 
was already pointed out in [5]. 

Since X consists of (all) \/-prime elements of £, every locally A^-ap- 
proximating standard extension Z is already locally approximating, 
and every A^-approximating standard extension Z is already approx- 
imating. This observation makes some of the “standard” applications 
easier to handle than the general case discussed in Sections 1-3. 
Propositions 3 and 7 have the following consequence for standard ex- 
tensions: 

Proposition 9. Let Z be a standard extension with idempotent op- 
erator A z . Then Z is locally or strongly approximating iff Z A , the 
closure system of Z-A-ideals, is a frame or completely distributive , 
respectively. In both cases, Z A is a sublocale of the Alexandroff com- 
pletion and A 2 is the corresponding nucleus. 

A downset Y belongs to the Galois closure of Z iff the Z-A-ideal 
generated by it is already a cut. Hence, Z is Galois closed iff A Z Y = 
AT implies T £ Z. The Polarity Theorems 1 and 2 amount to the 
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following “standard” version (cf. [14] for the first part and [12] for the 
second): 

Theorem 4. Associating with each locally approximating standard ex- 
tension Z of aposetP the collection Z A of all Z-A-ideals, one obtains 
precisely all standard completions that are frames. Similarly, the com- 
pletely distributive standard completions of P are precisely the systems 
of Z-A-ideals coming from strongly approximating standard extensions 
Z, In both cases, restriction to Galois-closed extensions makes the re- 
spective correspondences bijective. 

A slight modification of the A-ideal construction is often more adapted 
to concrete applications, because it works with joins instead of cuts 
but does not require any completeness properties of the involved poset. 
In the most general situation of a closure operation 8 on a frame L, the 
trick is to restrict the associated polarity relation g to the cartesian 
product L x J where J = (j _1 [A]. Let us speak of a J-closed set Z if 
it is contained in J and for all y E J , it follows from 8 z y E X that 
y E Z. It is easy to check that a subset has that property iff it is 
Galois closed with respect to the restricted relation gD(L x J) (i. e. 
Z Se fl J — Z), and that Proposition 4 and Theorem 1 remain valid for 
such J-closed sets. More precisely: 

Proposition 10. Let X be a join-dense subset of a locale L with 
X C J = 5 _1 [X], and let Y,Z be arbitrary subsets of L. Then Y is a 
sublocale of L with L 6 CY and Z = Y e D J iff Z is a J-closed, locally 
(A-) approximating subset with Y = Z s . 

Specifically, for a given poset P , we denote by J the standard ex- 
tension consisting of all downsets possessing a join (so that in fact 
J = A -1 [A] for the system X of principal ideals). Then, by defini- 
tion, a standard extension Z C J is (strongly) approximating iff the 
underlying poset is (strongly) ^-continuous in the sense of [4], [22] 
and [25], and Z is jT-closed iff each member of Z has a join and Z 
contains all lower sets Y for which the Z- join ideal generated by Y 
is principal. Thus, we obtain for the closure system Z v of all Z-join 
ideals the following variant of Theorem 4: 

Corollary 10. For every poset there is a dual isomorphism, sending 
Z to Z v , between the complete lattice of all J-closed, locally approx- 
imating standard extensions and the complete lattice of all standard 
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completions that are frames. Under this isomorphism, the J -closed, 
strongly approximating standard extensions correspond to the com- 
pletely distributive standard completions. The inverse isomorphism 
maps standard completions to the collection of all downsets whose clo- 
sure is a principal ideal. 

Corollary 11. The completely distributive T 0 closure systems are pre- 
cisely the Z-join ideal systems of strongly Z-continuous posets. 

The definitions of Z-cores and Z-below parts have manifold realiza- 
tions in algebra, topology, order theory and computer science. Let us 
mention a few examples. 

(1) Consider an element r in a Z -inductive poset, as introduced by 
Wright, Wagner and Thatcher [26] (see also [11]). The downset gen- 
erated by the core ozx (consisting of all Z-compact elements below 
x ) is just our Z-core of the corresponding principal ideal (and gener- 
alizes, therefore, the aforementioned construction of Z-below ideals). 
Z-inductivity requires that Ozx £ Z and x = \J ozx. Hence, for ev- 
ery Z-inductive poset, the standard extension Z A of all Z-downsets 
is approximating. 

(2) In [6], the (neighborhood) core of a point x in a topological space 
(X, T) was defined as the intersection of all neighborhoods containing 
x. This is nothing but the T- core of {x} in the power set VX if we take 
as closure operator the identity map on VX. (If we take the closure 
operator of the space instead, we get the intersection of all closed 
neighborhoods.) It was observed in [6] that complete distributivity of 
the topology T is equivalent to local supercompactness, requiring that 
each point has a neighborhood base consisting of cores, and to some 
other conditions generalized in Lemma 10 of the present paper. 

(3) Even in the seemingly rather special case Z = L, where S^y = 

A : y < Sz}, there exist interesting applications. Consider a 

closure system y on a set X and let T be the corresponding closure 
operator. Then C = {(J W : W C X} is the Alexandroff topology of 
all downsets with respect to the associated specialization quasiorder < 
given by x < y 4=> x £ r{y}. The £-core of Y £ C is easily computed 
by the formula 

(F = f|{Z££:T CTZ} 
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where the interior relation ( is defined by 

x(y y G int(t^) <=>■ y 0 T{,2 G X : x ^ z}. 

It was shown in [16] that assigning to each closure operator its in- 
terior relation, one obtains a one-to-one correspondence between all 
closure operators with a given specialization order < that preserve 
intersections of downsets, and the approximating, interpolating, aux- 
iliary relations for <; moreover, that the corresponding lattices of 
open (respectively, closed) sets are precisely the completely distribu- 
tive ones (where A is closed iff A = IM and U is open, i.e. X \ U is 
closed, iff U = U (). Using those results from [16], we obtain: 

Theorem 5. For any poset P = ( X , <), the following objects form 
four mutually isomorphic posets: 

(1) The completely distributive standard completions of P. 

(2) The closure operators having specialization order < and preserv- 
ing arbitrary meets of downsets. 

(3) The approximating and interpolating auxiliary relations on P. 

(4) The approximating Galois-closed standard extensions, ordered by 
dual inclusion. 

It may happen that these posets are neither isomorphic nor dually 
isomorphic to the corresponding posets of standard completions etc. 
for the dual of P. For example, the chain u> of natural numbers has 
exactly two completely distributive standard completions, whereas its 
dual has only one. The example below shows that descending chains in 
the above four isomorphic systems need not possess any lower bounds 
(so that completeness is violated). 

Example 3. The “deleted real unit square” 

Q = ]0, l] 2 U {(0, 0)} = {(aq, x 2 ) : 0 < aq < 1 or aq = x 2 = 0}, 

ordered componentwise by the usual <, is a frame but not a dual 
frame, hence not completely distributive. The relations ( n on Q de- 
fined by 

(x 1 ,x 2 )( n {yi,y 2 ) xi <yi, x 2 <y 2 and xix 2 < £ 

are easily seen to be auxiliary, approximating and idempotent, but 
their intersection is {(0,0)} x Q , which certainly cannot contain any 
approximating relation (though being auxiliary and idempotent). 
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Now, let us have a look at a few specific standard extensions (see 
[9, 10, 11, 12] for more background). 

(1) The smallest standard extension X, consisting of all principal ide- 
als (A-below ideals), is always strongly approximating and opposite to 
the largest standard completion £ = X A , which is always completely 
distributive. 

(2) On the other hand, for the largest standard completion £ and the 
smallest standard completion Af = £ A , we have (cf. [8]): 

Corollary 12. The Alexandroff completion £ of a poset is locally 
(A-) approximating iff the MacNeille completion Af is a frame, and £ 
is (strongly) approximating iff Af is completely distributive. 

(3) Consider the standard extension V of all directed downsets of a 
dcpo P. Here, the £>-core, respectively, £>-below part of f x is the 
way-below ideal of x. Invariance of V postulates that for any directed 
downset D, the set be again directed. But this is certainly the 
case in any continuous poset, because «fi is a directed union of 
directed downsets, namely the way-below ideals of the elements in D. 
Now, the results in Section 3 confirm that the way-below relation of a 
continuous poset has the interpolation property [18]. As remarked in 
the introduction, V is (strongly) approximating, i.e. P is a continuous 
poset, iff the Scott completion V y is completely distributive. 

Similar arguments apply to any union complete subset system Z [26], 
showing that for such systems the iT-below relation of a Z-continuous 
poset is idempotent and the system of Z - join ideals is completely 
distributive (see [4, 5, 12, 22]). 

For a more comprehensive discussion of local approximation properties 
of the standard extension V, we refer to [13] and [14]. Here we only 
mention the fact that the local approximation property of V entails 
the frame property for the Scott completion, but not conversely! 

(4) For the standard extension of all finitely generated downsets, 
f={|F:FCj], 

the associated preclosure operator A T is idempotent, hence a closure 
operator. It assigns to each subset Y of X the Frink ideal generated 
byY, 
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A F Y = (J{AF : F C W Y}. 

Thus, the corresponding closure system 

F A = {Y CX : AF CY for all F C U F} 

consists of all ideals in the sense of Frink [17]. In that context, the 
results of Section 1 reduce to known characterizations of posets with 
distributive ideal lattices [8]: 

Corollary 13. For any poset P = (X, <), the following conditions 
are equivalent: 

(a) The lattice F A of all Frink ideals is distributive (a frame). 

(b) T a is a sublocale of C, the frame of all downsets. 

(c) The cut operator A preserves finite intersections in T A . 

(d) The ideal operator A T preserves finite intersections in C. 

(e) A T induces a (pre)nucleus on C. 

(f) For F C- u X and x £ A F, there is an E C u fF with x = \J E. 

(g) F is locally approximating. 

A V-semilattice P with these properties is called distributive (see, 
for example, [19]). In [8], an example of a V-semilattice has been 
given whose ideal lattice fails to be distributive, although the sys- 
tem of all downsets (not only the finitely generated ones) is locally 
A-approximating. Of course, that failure cannot happen with A- 
semilattices, because then the finitely generated downsets form a A- 
semilattice, too, and Corollary 1 applies. For the “global” situation, 
we get: 

Corollary 14. The lattice F A of all Frink ideals of a poset P is 
completely distributive iff F is (strongly) approximating, that is, for 
each x £ P there is a least F £ F with x £ A F. In case P is a V- 
semilattice, the latter condition holds iff each element is a finite join 
of V -prime elements, in other words, P is a free semilattice over a 
poset. 

These free semilattices are, up to isomorphism, just the standard ex- 
tensions F of their subposets of V-prime elements, and conversely, the 
standard extension of all finitely generated downsets of an arbitrary 
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poset is “the” free V-semilattice over that poset. Our final exam- 
ple shows that the standard extension T of a V-semilattice may be 
(strongly) approximating without being locally A-approximating. 

Example 4. The “double real square” 

C = [0, 1] x [0, 1] x {0, 1} 

is completely distributive. For any number r with 0 < r < 1, the 
subset 

S = C\ (({r} x [0. 1] x {0}) U ([0. 1] x {r} x {0})) 

is a V-subsemilattice whose elements are joins of finitely many V-prime 
elements. For x = (1, 1, 0), y = (r, 1, 1), z = (1, r, 1) and F — i{y, z}, 
we obtain x <\J F, but ^x f)F is not finitely generated. 
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Galois Connections and Complete 
Sublattices 

K. Denecke, S. L. Wismath* 



Abstract 

Any Galois connection (between two sets) has associated with it two clo- 
sure operators, and hence two complete lattices (dually isomorphic to each 
other). Set-theoretical Galois connections are induced by relations, and 
it is interesting to know which subrelations of an inducing relation will in 
turn induce complete sublattices of the two lattices. Complete sublattices 
of the Galois- lattices may also be obtained using conjugate pairs of closure 
operators. It is also well known that the set of fixed points of a closure (or 
a kernel) operator defined on a complete lattice forms a complete lattice, 
and in this case too we can look for properties which ensure that this lattice 
is a complete sublattice. This paper surveys the results known about such 
subrelations, conjugate pairs of closure operators, and closure and kernel 
operators. We also give an application of the closure and kernel opera- 
tor method, which generalizes several well-known examples from universal 
algebra using the Galois connection (Id, Mod). 

AMS Classification: 06B23,06A15. 

Key Words: Galois connection , Closure operator , Kernel operator , Com- 
plete sublattice. 



1 Introduction and Notation 

A Galois connection is a pair of mappings with certain properties, be- 
tween two sets of objects, usually of different kinds. Such a connection 
can provide a useful tool for studying properties of one kind of object, 
based on the properties of the other (usually more well-known) kind 
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of objects. In the classical Galois theory, for instance, properties of 
permutation groups are used to study field extensions. In universal 
algebra, the Galois connection (Id, Mod) between classes of algebras 
and sets of identities allows us to study algebraic structures either 
algebraically, in terms of the usual operators H, S and P, or equa- 
tionally, in terms of the identities satisfied by a given algebra or class 
of algebras. The Galois connection (Pol, Inv ) between sets of oper- 
ations and sets of relations, on a fixed base set, is also important in 
universal algebra, and combines the structure-theoretic point of view 
with the logical approach. 

For any Galois connection between sets A and B, the corresponding 
collections of closed sets form complete lattices, which are dually iso- 
morphic to each other. We remark that the propositions formulated 
here for sets are also valid for proper classes. For the two examples 
mentioned above, we have the lattices of all varieties and of all equa- 
tional theories of a fixed type, and the lattices of all clones and of all 
relational clones on a fixed base set. Since such lattices are usually 
very large and complex, it is useful to find ways to study pieces of 
them, particularly their complete sublattices. In this paper we give 
a survey of results on three such methods for constructing complete 
sublattices of the lattice of closed sets obtained from a Galois connec- 
tion: conjugate pairs of closure operators, Galois-closed subrelations, 
and closure and kernel operators with particular properties. We ap- 
ply these results to produce a general method for finding complete 
sublattices, and illustrate it with some examples for the connection 
(Id, Mod). This method encompasses several well-known examples 
involving normal and regular identities, and also provides a frame- 
work for the introduction of some new examples. 

In the remainder of this section we fix the main definitions and nota- 
tion to be used throughout. We let A and B be fixed non-empty sets. 
A Galois connection between A and B is a pair (t, /j) of mappings 
between the power sets V(A) and V(B), 

i : V(A) V(B) and /i : V(B) V(A), 

such that for all T, T' C A and all S, S' C B the following conditions 
are satisfied: 



(i) TCT'=> l(T) t(T'), and 5 C S' => n(S) D /i(F'); 




Galois Connections and Complete Sublattices 



213 



(ii) T C fu(T), and S C l/jl(S). 

That is, the two mappings in a Galois connection reverse inclusions, 
(map larger sets to smaller images), and applying the composite of the 
two mappings to a given set produces a set which contains the original 
one. It is well known that for any Galois connection (i, fj) between 
A and B, the composite mappings [u and tfi are closure operators 
on A and B respectively, meaning that they are extensive, monotonic 
and idempotent mappings. The sets closed under fu are precisely the 
sets of the form for some SC B; the sets closed under l/j, are 

precisely the sets of the form l(T), for some T C A. 

In general, for any closure operator 7 defined on a set A the set % 1 of 
all 7-closed subsets of A forms a complete lattice. In this lattice, the 
meet operation, also the greatest lower bound or infimum with respect 
to the partial order of set inclusion, is the operation of intersection. 
The join operation however is not usually just the union: we have 



vs=n{tf ^ 1 h 2U e } 

for every B C % 1 . 

In the special case of a Galois connection between sets A and B, we 
obtain two closure operators fu and tfi, and thus two complete lattices, 
'Hfj.o and of closed sets. These two lattices are dually isomorphic 
to each other. 

A relation between the sets A and B is simply a subset of A x B. Any 
relation R between A and B induces a Galois connection, as follows. 
We can define the mappings 

l:V(A)^V(B), il:V{B)^V{A), 

by 

i{T ) : = {seB I VteT ((t,s)eR)}, 
n(S) : = {t e A | Vs e S ((t, s) e R)}. 

Then it is easy to verify that the pair (t, /i) forms a Galois connec- 
tion between A and B, called the Galois connection induced by R. 
Conversely, any Galois connection between A and B determines a re- 
lation R C Ax B. Thus there is a one-to-one correspondence between 
relations from A to B and Galois connections between A and B. 
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For example, the Galois connection (Id, Mod ) between classes of alge- 
bras and sets of identities corresponds to the relation R of satisfaction: 
for any algebra A and any identity u « v, the pair (A,u Ij'j IS 111 
R iff the algebra A satisfies the identity u « v. Similarly, the Galois 
connection (Pol, Inv) is based on the relation of preservation (of a 
relation by an operation). 

2 Conjugate Pairs of Closure Operators 

Given a relation R from A to B , and the corresponding Galois con- 
nection (l, (i) between A and B, we obtain the complete lattices R^ 
and R LfM of closed subsets of A and B respectively. Our goal in this 
section is to construct, from the given relation R and the induced Ga- 
lois connection, a new relation and Galois connection, which produce 
complete sublattices of R^ and R Lfl . The tool we use is the concept 
of a conjugate pair of closure operators. 

Definition 2.1. A closure operator 7 defined on a set A is said to be 
additive if for all T C A, 7(F) — [J 7(a). (Note that we write 7 (a) 

d^T 

for ida})). 

We can show easily that when 7 is an additive closure operator, the 
least upper bound operation on the lattice R 7 agrees with (J B (see 
[Rei] or [DG]). We always have (J B C ^ifJB) because of the ex- 
tensivity of 7. Conversely, if a 6 (JB then a E B € B for some 
set B £ B, and since B £ R 7 we have 7(a) C (J B and 7((J^) = 
U b( a ) — U U & = U This means that |J B is 7-closed and 
aeUE aelje 

\f B = 1J B. In other words, when 7 is an additive closure operator 
on A, the corresponding closure system forms a complete sublattice of 
the lattice (V(A); A = fj, V = (J) of all subsets of A. 

(That the additive closure operators are in one-to-one correspondence 
with union-closed closure systems (Alexandroff topologies) is well- 
known. In the thirties of the past century Alexandroff dicovered 
a one-to-one correspondence between additive closure operators and 
quasi-orders.) 

Definition 2.2. Let 71 be a closure operator defined on the set A and 
let 72 be a closure operator defined on the set B. Let R C A x B 
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be a relation between A and B. Then 71 and 72 are called conjugate 
with respect to R, and we say that the pair (71, 72) is a conjugate pair 
with respect to R, if for all t 6 A and all s £ -B, 71 (i) x {s} C R iff 
{i} x 72 (s) C R. 

This property of conjugacy of two closure operators is defined in terms 
of individual elements. When the two operators are also additive, we 
can extend this to sets of elements. Thus when (71,72) is a pair of 
additive closure operators, 71 on A and 72 on B, and they are conju- 
gate with respect to a relation R C A x B, then for all T C A and all 
S C B we have T x 72 (S') C R if and only if 71 (T) x S C R. 

The two closure operators pi and tp, obtained from a Galois connec- 
tion induced by a relation R, are always conjugate with respect to R. 
But /. it and ip need not be additive in general. 

Definition 2.3. ([DWB]) Let 7 := (71, 72) be a conjugate pair of ad- 
ditive closure operators, with respect to a relation R C A x B. Let R 1 
be the following relation between A and B: 

R 1 := {(t, s) G A x B | 7x(t) x {s} C R}. 

We now have two relations and Galois connections between A and B. 
We have the original relation R, with induced Galois connection (1, p) 
between A and B , and corresponding lattices of closed sets. We also 
have the new relation and its induced Galois connection, which we 
shall denote by (t 7 ,/r 7 ). The following theorem gives some properties 
relating the two Galois connections. 

Theorem 2.4. ([DWB]) Let 7 = (71,72) be a conjugate pair of addi- 
tive closure operators with respect to R C A x B . Then for allT C A 
and S C B, the following properties hold: 

(i) p-yiT) = /i(7i(T)), 

(ii) p 7 (T) C p(T), 

(Hi) 72 (/g( t )) = ^( t )> 

(iv) 7l (t(^ 7 (r))) = i{Pi(T)), 
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(v) // 7 (t 7 (S')) = p,(i(j2(S))) ; and dually, 

(i ') h (S) = 472 (5)), 

(a ') ^(s) c i(s), 

(in') 71 (i n (S)) = t 7 (S), 

fa V 72(/*M‘S'))) = V(h( s ))> 

(v') i^iT)) = 4M7i(T))). 

The next theorem is the “Main Theorem for Conjugate Pairs of Clo- 
sure Operators”, from [Rei]. It shows that when we consider sets 
which are closed under the original Galois connection from R, there 
are four equivalent conditions for such sets to also be closed under the 
new connection from R 7 . 

Theorem 2.5. (Main Theorem for Conjugate Pairs of Additive Clo- 
sure Operators) Let R be a relation between sets A and B, with cor- 
responding Galois connection (//, t)- Let 7 = (71,72) be a conjugate 
pair of additive closure operators with respect to the relation R. Then 
for all sets T C A with if/jfT)) = T the following propositions (i) - 
(iv) are equivalent; and dually, for all sets S C B with p,(i(S)) = S, 
propositions (i ) - (id ) are equivalent: 

(i) T = t 7 (// 7 (T)), 

(ii) 71 C T)=T, 

(in) fi(T) = /i 7 (T), 

(iv) 7 2 (/i(T)) = n(T); and dually, 

(i ') S = ia y ( h (S)), 
fa ') 72 (S) = S, 

(in ' ) i(S) = i n (S), 

(w') 7l (*(S)) = t (S). 
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Proof: We prove the equivalence of (i),(ii),(iii) and (iv); the equiva- 
lence of the four dual statements can be proved dually. 

(i) =>- (ii) We always have T C 71 (T), since 71 is a closure opera- 
tor. Since ip is a closure operator we also have 7 i(T) C qu( 7i(T)) = 
hithiT)) = T, by 2.4 (V) and (i). 

(ii) =>- (iii) We have /i(T) = ^(71 (T)) = // 7 (T) by (ii) and 2.4 (i). 

(hi) =>- (iv) We have 7 2 (m(T)) = 7 2 (/r 7 (T)) = ^ 7 (T), using 2.4 (iv) 
and (iii). 

(iv) =>• (i) Since the i 7 // 7 -closed sets are exactly the sets of the form 
t 7 ( S' ), we have to find a set A C B with T = t 7 (A). But we have 
t 7 (//(T)) = t(72(//(T))) = i(p(T )) = T, by 2.4 (i') and our assumption 
that T is qu-closed. ■ 



Before we use this Main Theorem to produce our complete sublattices, 
we need the following additional properties. 



Theorem 2.6. Let R be a relation between sets A and B, with Galois 
connection Let 7 = (71,72) be a conjugate pair of additive 

closure operators with respect to R. Then for all sets T C A and 
S C B, the following properties hold: 



(i) 7 ,(T) C i(fi(T)) «■ 
(») 7i(T) S i(p(T)) «■ 
(i ') 12 (S) c M ([(S)) o 



/.(/i(T)) — ; 

71 ( 4 (/i(T))) = t(/i(T)); 
/i(i(5)) = // 7 (t 7 (A)); 
72(/i(i(5'))) =/i(i(5)). 



V 72(5) c p{i(S)) 4* 

Proof: We prove only (i') and (ii'); the others are dual 



(i') Suppose that 72(5) C fi(t(S)). Since fu is a closure operator 
we have p(i(S)) = p(t(p(t(S)))) D p(t(-f 2 {S))) = jU 7 (t 7 (A)), by our 
assumption and by 2.4 (v). Also S C 72(A), and hence we have 
C 2(A))) = // 7 (i 7 (A)), again by 2.4 (v). For the converse 
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we have 72 (5) C //(i(7 2 (S'))) = /i 7 (i 7 (5)) = using the exten- 

sivity of pi, 2.4 (v) and our assumption. 

(ii') Let 72 (S) C (p(i(S)). Then S C 72 (S) implies that 7 2 (//(t(5))) 
C 72(A*(t(72(5')))). We also have 72(1^(72(5)))) = 7 2 (/u(t 7 (5))) by 
Theorem 2.4 (i'), and 7 2 (^(t(72(5)))) = p(i 7 (S)) by 2.4 (iv'). In ad- 
dition, p(i 7 (S)) = h( l {J2 (5))) C //(i(/i(t(5)))) = p(i(S)). Altogether 
we obtain 7 2 (//(t(5))) C The opposite inclusion is always 

true, since 72 is a closure operator. Conversely, 5 C p(i(S)) implies 
72(5) C 7 2 (p(i(S))) = by the extensivity of pi, the mono- 
tonicity of 72 and our assumption. ■ 



Now we have the machinery to produce our complete sublattices. From 
the original relation i? and Galois connection (/i, 1) we have two (du- 
ally isomorphic) complete lattices of closed sets, the lattices R^ and 
R tfJi . We also have two complete lattices of closed sets from the new 
Galois connection (i 7 , p 7 ) induced by R 1 . Our result is that each new 
complete lattice is in fact a complete sublattice of the corresponding 
original complete lattice. 

Theorem 2.7. Let R be a relation from A to B, with induced Galois 
connection (/a, i)- Let 7 = (71,72) be a conjugate pair of additive 
closure operators with respect to R. Then the lattice R^ Ll of sets 
closed under p 7 t 7 is a complete sublattice of the lattice R 0i , and dually 
the lattice R h01 is a complete sublattice of the lattice R ltl . 

The proof can be found in [Rei] . 

Conjugate pairs of additive closure operators are an important way to 
construct complete sublattices of given closure lattices. We may also 
consider the set of all such conjugate pairs, and define on it an order 
relation, as follows: for a = (71,71) and j 3 = (72,72) we set 

a<fl:^(VT C A)(V5 C B)[ 7 l (T) C 7a (T) and 7 ((5) C 7' (5)]. 

When a < ( 3 , it can be shown that the lattice R 00L0 is a sublattice of 
Rn ala , and dually that R i0ll0 is a sublattice of Rt alia - The following 
additional properties may also be verified: 

(i) 7 i(‘(/j(T))) = 4 m(T)) <=► T = and 
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(i ') 72 {MS))) = M s )) ^S = Ml 2{T))). 



The theory of conjugate pairs of additive closure operators has been 
generalized in [Arw] to the situation of conjugate pairs of extensive, 
additive operators. 

3 Galois-Closed Subrelations 

In the previous section we used a pair of additive closure operators, 
which is conjugate with respect to the relation R, to determine a new 
relation R 7 , which in turn induces a new Galois connection and closure 
operators. We showed that the sets closed under these new operators 
form complete sublattices of the original lattices of closed sets. Now 
we want to focus on the relations such as R 7 which determine com- 
plete sublattices of our original lattices. In general, we can look at 
subrelations R! C R , and consider the complete lattices of closed sets 
obtained from the Galois connection induced by R'. When are these 
lattices complete sublattices of the lattices obtained via R1 In this 
section we describe a property of subrelations R' of R which is suf- 
ficient to guarantee that the new complete lattices obtained from R! 
will be complete sublattices of the original lattices, and show that any 
complete sublattices of our original lattices arise in this way. The 
property we need is called the Galois-closed subrelation property. 

Definition 3.1. Let R and R! be relations between sets A and B, 
and let (/q i) and (//, d) be the Galois connections between A and B 
induced by R and R' , respectively. The relation R' is called a Galois- 
closed subrelation of R if: 

1) R' C R, and 

2) for all T C A and all S C B, we have (n'{T) = S and d(S) = 
T => n(T) = S andt{S)=T). 

The following equivalent characterizations of Galois-closed subrela- 
tions were given in [GW], [ADP] and [Den]. 



Proposition 3.2. Let R' C R be relations between sets A and B. 
Then the following are equivalent: 
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(i) R 1 is a Galois-closed subrelation of R; 

(ii) For any T C A, if i'p![T) = T then fi(T) = p!{T), and for any 
SCB, if n't,' (S) = S then l{S) = i'{S); 

(in) For allT C A and for all S C B the equations dp,'(T) = ipt'(T) 
and p'i'(S) = yu'{S) are satisfied. 

The proof is straightforward and will be skipped here. 

It is also straightforward to show that if 7 := (71,72) is a pair of ad- 
ditive closure operators which are conjugate with respect to a relation 
R C A x B, then the relation of Definition 2.3 is a Galois-closed 
subrelation of R. 

The significance of Galois-closed subrelations lies in the fact that there 
is a one-to-one correspondence between Galois-closed subrelations of 
a relation R C A x B and complete sublattices of the corresponding 
lattices % Lfl and of closed sets. The remainder of this section is 
devoted to a scetch of the proof of this claim. Specifically, we will 
show that any Galois-closed subrelation R' of the relation R yields 
a lattice of closed subsets of A which is a complete sublattice of the 
corresponding lattice FL ifl for R, and conversely that any complete 
sublattice of the lattice occurs as the lattice of closed sets induced 
from some Galois-closed subrelation of R. Dual results of course hold 
for the set B. 

Theorem 3.3. ([GW], [Den], and [ADP]) Let R C A x B be a rela- 
tion between sets A and B, with induced Galois connection (//, t). Let 
PL tfi be the corresponding lattice of closed subsets of A. 

(i) Lf R ! C A x B is a Galois-closed subrelation of R, then the class 
Uri := Hi' n' is a complete sublattice ofR i(Ji . 

(ii) IfU is a complete sublattice ofR LjJl , then the relation 

R u := U {Tx p(T) | TeU} 
is a Galois-closed subrelation of R. 
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(Hi) For any Galois-closed, subrelation R' of R and any complete sub- 
lattice U ofFL ifu we have 

Uru = U and R Unz , = R'. 

Scetch of the Proof: (i) We begin by verifying that any subset of 
A which is closed under the operator dp' is also closed under ip, so 
that the lattice Rrp is at least a subset of R Lil . Let T e so that 
dp'(T) = T. By Proposition 3.2, parts (ii) and (iii), we have 

ip(T) = l p'(T) = dp'(T) = T. 

Therefore, Rj^ C R LfJl . 

Next we have to show that this subset is in fact a sublattice. This 
means showing that for any family {Tj \ j G J} of sets in Rrp, both 
the sets f\ {Tj \ j 6 J} and \/ {Tj \ j e J} are in 

W-Lp, H-L (X 

We know from part (i) that the collection {Tj \ j £ J} is also a family 
of sets in Since 

j£J 

it is enough to show that 

By repeated use of Lemma 1.1 and Proposition 3.2, we check that 
jeJ jeJ 

This shows that i p( (J Tj) is also a fixed point under dp,', so that it 
ieJ 

too is an element of . 

(ii) Now let U be any complete sublattice of We will prove that 
the relation Ru is a Galois-closed subrelation of R. First, for each 
non-empty T £ U we have p(T) = {s G B \ V t G T, (t, s) € R} and 
then T x p(T) C R. Therefore Ru G R. To show that the second 
condition of the definition of a Galois-closed subrelation is met, we 
let (//, d) be the Galois connection between sets A and B induced by 
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Ru, and assume that fj/(T) = S and i'(S) = T for some T C A and 
S C B. Our goal is to prove that 

fi(T) = S and 4(5) = T. (*) 

For each t E T, we define 

D t = P| {T' E U | t E T' and S C //(T')}. 

The following facts can be verified: 

(a) /x'({t}) = n'(D t ). 

(b) ({*}) = D t . 

(c) T = U D t . 

teT 

(d) n(T) = n'{T) = S and t(S) = t'(S) = T, and (*) holds. 

From the fact that for any set T EU, if //(T) = S, then t(S) = 
we have i(S) = t'(S) = i' //(T) = T. This shows that (*) holds, com- 
pleting the proof of (ii) that Ru is a Galois-closed subrelation of R. 

(iii) Now we must show that for any complete sublattice U of and 
any Galois-closed subrelation R' of R, we have Ur u = U and Ru , = R' . 

We know that Ur u := %,///> the lattice of subsets of A closed under 
the closure operator </// induced from the relation Ry. This means 
that T £ Ur u iff i'n'{T ) = T. First let T £ Ur u , and let S be the 
set fJ> r (T). Then we have t'(5) = T, and since Ry is a Galois-closed 
subrelation of R we conclude that 

fi(T) = S and 4(5) = T. 

If T = 0 then T EU, and for T ^ 0 we can show that T = (J D t (see 

teT 

(c) above). But now T = l //(T) = D t ) = (J {D t \ t E T} EU. 

teT 

This shows one direction, that Ur u C U. 

For the opposite inclusion, let T Eli. Then using the fact that U is a 
sublattice we have 

4 V(T) = 4 V(T) = 4/r(T) = T. 
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This shows that T £ which is equal to Ur u . We now have the 

required equality Ur u =U. 

Now let R 1 be a Galois-closed subrelation of R, and set 

U w := = {T C A | iV'(T) = T}, and 

Ru r , '■= U{T x //(T) | T £ Uri}. 

We will show that Ru r , = R! ■ 

First, if (t,s) £ R! then s £ p'({t}). Setting S := p'({t}), we have 
s £ S and l'(S) = i'p'({t}). Now taking T := i'p!{{t}), we have 
i’{j!(T) = T, so T £ U R and //(T) = 5 and d(S) = T. Therefore 
//(T) = S and l(S) = T. 

Since t £ i'[//({t}) = T and s £ S = //(T), we get (t, s) £ T x //(T) 
and T £ Uri . Hence (f , s) £ , and we have shown that R! C r u rI - 

To show the opposite inclusion, let T £ Uri, and let S = n{T). Then 
from Facts 1 and 2 we have 

//(T) = fi(T) = S and i'(S) = i(S) = l / i(T) = T. 

Therefore T x n(T) C R! , and Ru r , G R! . Altogether, we have Ru r , = 
R! . This completes the proof of part (iii), and of our theorem. ■ 

4 Closure and Kernel Operators 

It is well known that there is a 1- 1-correspondence between closure 
operators (p : C — >■ C on a complete lattice C and closure systems S 
on C (that is, subsets of £ closed under arbitrary meets). This cor- 
respondence is given by the following maps. For any closure operator 
cp :£—>■£, we get the closure system 

5 := Fix(<p) := = {T £ £ \ p(T) =T} 

of all fixed points of <p\ and for any closure system S we have the 
closure operator p defined by 

p(T) = <p s (T) := A {T' £ S \ V > T} for T £ £. 
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Moreover, for any closure system S and any closure operator p, we 
have = S and p nip = p. 

There is also a dual 1- 1-correspondence between kernel operators and 
kernel systems on a complete lattice £. A kernel operator is a mapping 
which is intensive, monotonic and idempotent, and a kernel system on 
£ is a family S of subsets of £ which is closed under arbitrary joins. 
Then for kernel operators ip : £ — >■ £ on £ and kernel systems S on 
£, we set 

S := Fix (ip) := {T e £ | ip(T) = T} ; 
c 

ip(T) := ip s (T) := \J{T £ S \ T < T} for T G £. 

We have Fix (ip 5 ) = S and 'tpFix{i>)= ^ f° r an Y kernel system S and 
any kernel operator ip on £. 

A result of A. Tarski ([Tar]) shows that for any closure operator p on 
a complete lattice £, the closure system (fixed-point set) is always 
a complete lattice with respect to <. However, it is not necessarily 
a sublattice of £. Thus we look for some additional condition under 
which a complete sublattice is obtained. The answer (and its dual for 
kernel operators) is given in the following well-known theorem ([Tar]). 

Theorem 4.1. Let £ be a complete lattice. 

(i) If p is a closure operator on £ which satisfies 

c c 

■rtVW I i e = VteW) I i 6 W 

for every index set J, then the set of all fixed points under p, 

= {T £ £ I p(T) = T} 

is a complete sublattice of £ and p{£) = H v - 

(ii) Conversely, if H is a complete sublattice of £, then the function 
P'n which is defined by 

c 

Pn(T) := /\{T' £%\T < T'} 
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is a closure operator on £ with <p-u(C) = %, and satisfies the con- 
dition (*). Moreover, %<?■». = ^ an< ^ = <£• 

(iii) If ip is a kernel operator on £ which satisfies 

c c 

Mra i * e = Aww) 3 6 J > («) 

for every index set J , then the set of all fixed points under ip, 

?fip = {T e £ \ ip(T) = T} 
is a complete sublattice of £ and ip(£) = 

(iv) Conversely, if % is a complete sublattice of £ then the function 
ipu which is defined by 

c 

4>h{T) := \f{T' en\r <T} 

is a kernel operator on £ with ipu(£) = kip, and ip satisfies the con- 
dition (**). Moreover, ktp n = % and ip H ^ = ip. 

5 An Application 

We saw in the previous section that any join-preserving closure opera- 
tor on a complete lattice induces a complete sublattice of the lattice. In 
this section we describe a technique to produce such a join-preserving 
closure operator. Our technique is a relatively simple one, but when 
applied to the Galois connection (Id, Mod) leads to some interesting 
and new examples of complete sublattices of the lattice of all varieties. 
As before, we assume that (t, p) is a Galois connection between two 
sets A and B. Let E C B be a subset of B which is closed under 
the closure operator ip, so that ip(E) = E. We define the following 
functions. 

Definition 5.1. Let <pg : V(B) — > V(B) be defined by S ^ (S) = 

S fl E, for every S C B. 

Let ipv : V(A) — > V(A) be defined by T ^ <Pe(T) = p(ip^(i(T))) , for 
every T C A. 
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Then the following proposition is easy to prove. 

Proposition 5.2. (i) The operator p^ is a kernel operator on B which 
preserves arbitrary unions and non-empty intersections, so that for 
any indexed family (Sj)j<=j of subsets of B, we have 

>' " f r l I j € J}) = n { Pf iS/t j r J J ant l 

<pf(U{Sj I j 6 J}) = u {v>f(S,) \j € J}. 

(ii) The operator pf is a closure operator on the lattice % lu of the 
subsets of A closed under the closure operator pi, and it preserves 
non-empty joins. 

As a consequence of this Proposition and Theorem 4.1, we have the 
following result. 

Theorem 5.3. Let E C B be a subset of B closed under the closure 
operator ip, and let Pe : V(A) — > V(A) be the closure operator de- 
fined by Pe{T) = p(p% (i(T))), where Pe( l ( t )) = H E for every 
T e Tt^. Then the set of all p^-closed subsets of A forms a complete 
sublattice of the lattice TL^. m 



The next proposition lists some additional properties of the operator 
Pe which we shall need. 

Proposition 5.4. (i) i(p(p^(S))) C Pe(l(/j,(S))) , and if S is a subset 
of B for which t(p(S)) = S, then also l(p{pe(S))) = Pe{^{p{S))). 
(ii) For all sets S and S' C B, we have i(p(p^(Lp(S) U ip(S')))) C 

fiMS u s')). 

Now let T be a subset of A which is closed under the operator pi, so 
that T € FL/di.. It is clear that the collection of all subsets of T which 
are closed under pi forms a sublattice C(T) of FL^. We will denote by 
C(pe(T)) the lattice of all subsets of Pe(T) which are closed under 
pi. Note that the set Pe{T) is itself such a closed set. Then we get 
the following result. 

Theorem 5.5. Let T e TL be a set such that Pe(T) ^ T. Assume 
that the operator p f satisfies the inclusion 




Galois Connections and Complete Sublattices 



227 



(*)tp%(L(i(SuS')) C ^M5)U#'))))J C .r5= t ([/ 1 ) and S' = 
i(U 2 ) with U\ and U 2 G £(T). Then the mapping A : £(T) — > £(T) 
defined by U ^ Te(U) for all U C T is a lattice homomorphism. 

Proof: It follows from Proposition 5.2 that the map ip A preserves 
joins, and we have to prove now that it also preserves meets. We have 

^(E/i A u 2 ) = p£(Ui A u 2 ) = A u 2 ))) = u 

I'(U 2 )))) = U <Pe(w(i(U 2 )))), by (*) and Proposi- 
tion 5.4(ii). Since L(<pg(Ui) = (t(f/j)), for i = 1,2, we see that 

V&U! A U 2 ) = U <P%{i>MU 2 ))))) 

= u ^(t(u 2 ))) 

= I*(wM<Pe(U 1)) u ‘(VeM)))) 

= <pi(U 1 )Afi(U 2 ). m 



As our source of examples we now consider the following situation. 
Let t be any type of algebras, with W T {X) the set of all terms of 
type r on a countably infinite alphabet X. We take A to be the class 
Alg{r) of all algebras of type r, and B to be the set W r (X) 2 of all 
equations of type r. We use the maps t = Id and /i = Mod, where 
for any K C Alg(r) and any £ C W T (X) 2 

Id(K) = {u « v G W T (X) 2 | A satisfies u « v for all A G K}, 
Mod(Yf) = {A G Alg(r) \ A satisfies u ~ v for all u ~ v G £}. 

It is well known that these two maps form a Galois connection between 
Alg(r) and W T (X) 2 , induced by the relation of satisfaction between 
algebras and identities. The corresponding complete lattices are the 
lattice £(r) of all varieties of type r, and the lattice £{r) of all equa- 
tional theories of type r. 

Now we take E = E(r) to be any property of identities which is an 
equational theory, that is, any element E G £(t). The map with 
£ 1 — y HOE is then a kernel operator which preserves meets. By Theo- 
rem 5.3, the mapping = Modp^Id is a closure operator on 

the lattice £(r), and preserves joins. Then the collection of ^-closed 
varieties forms a complete sublattice of £(r). This corresponds to the 
Galois connection induced by the Galois-closed subrelation 

Re = {(A, s « t) G Alg{r) x E \ A satisfies s « t}. 

Corollary 5.6. Let E(r) be any property of identities which forms an 
equational theory. Then the class of all p^-closed varieties of type r 
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forms a complete sublattice of the lattice C{t) of all varieties of type 

T. 

Two well-known examples of this corollary arise from taking E = E(r) 
to be either the set N(r) of all normal identities of type t, or the set 
Reg(r ) of all regular identities of type r. Other kinds of identities 
which have been used in this way are the externally compatible or 
^-compatible identities. These various kinds of identities have been 
studied by E. Graczynska, I. I. Mel’nik and J. Plonka (see [Gra], [Mel] 
and [Plo]). 

in [DW] we will introduce a countably infinite family of new such 
choices for E, based on the idea of complexity of terms and identities, 
and apply our theory to this new setting. This will allow us to general- 
ize the notions of normal varieties and normalization of a non-normal 
variety, to ^-normalizations, for each natural number k. 
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Abstract 

This paper reports on various Galois connections between operations and 
relations. Several specifications and generalizations are discussed. 

AMS Mathematics Subject Classification: 08 A 55 , 03B10. 

Key words: Operations , Relations, Clones, Relational algebras. 



Introduction 

Operational systems and relational systems are basic structures in 
algebra and logic. Here we focus on one fundamental connection be- 
tween operations and relations which is based on the invariance rela- 
tion “an operation / preserves a relation p”. This induces a Galois 
connection 1 (Pol — Inv, see 1.3, 2.2) between sets of operations and 
sets of relations. Many well-known algebraic concepts fit into this 
framework (automorphisms, subalgebras, congruences, see e.g. proof 
of 3.5). 

There are numerous results on various Galois connections between 
operations and relations characterizing the Galois closed sets. In this 
paper we undertake the attempt to collect and systematize these re- 
sults and to give a kind of survey. Therefore almost no proofs are 
given and besides new formulations there are no new results. More- 
over, we were not able to give a complete survey. We mainly report on 

1 In [McKMT87, p. 147] this Galois connection is emphasized as the most basic 
Galois connection in algebra. In [Wil03] it is mentioned as a general framwork for 
the development of dyadic algebra. 
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results which can be found in the following (not so easily accessible) 
papers: [Pos80a] (some of the results of this report can also be found 
in [Pos79]), [Pos84], [BorOO] and [BorPS]. 

For an orientation to which Galois connections are discussed the reader 
may at first consult the Table 1 in Section 3 and Table 2 in Section 4. 



1 Basic notions and notations 



1.1. Operations and Relations. For operations / : A n — » A and 

relations g C A m (m, n £ N + = {1, 2, 3, . . . }) on a fixed base set A we 
introduce the following notation: 



Op ln >(A) 


:= {f\f:A n ^A} 


(n-ary operations) 


Op(A) 


■■= U“=1 Op (n) (^4) 


(finitary 

operations) 


1 'r(A) 


:= Op (1) (A) 


(transformations, 
i.e. unary 
permutations) 


Sym(/1) 


:= {/ £ Tr(A) | / bijective} 


(permutations) 


Rel (m) (yl) 


:= {g\g^A m } 


(m - ary relations) 


R*1(X) 


:= Um=i Rel (m) (A) 


(finitary relations) 


VW 


:= Rel (1) (A) 


(power set) 


Eq(A) 


:= {0 £ Rel^(A) | 9 equivalence} 


(equivalence 

relations) 



The {A) may be omitted if it is clear from the context. 0-ary functions 
or relations are not considered (mainly for technical reasons). 



An m-tuple r £ A m sometimes will be regarded as a mapping r : 
m — > A with m := {1, . . . , m}, and its components are given by r = 
(r(l),...,r(m)). 

1.2. Invariance. Operations and relations can be connected by the 
invariance property: A relation g £ Rel^(A) is invariant for an 
operation / £ Op^^A) (we also say, / preserves g, or / is a polymor- 
phism of g, notation [Wil03] : / > g) if f[g, g] C g , i.e., if for all 
n, ■ ■ ■ , r n £ g we have f[r u ...,r n ]eg. Here the m-tuple f[r x , ...,r n ] 
is defined component-wise by 



fVu ■ ■ -An]{i) := ■ ■ -,r n (i)) (i £ m). 
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Remark: In the universal algebra setting this means that g is (the 
carrier of) a subalgebra of the algebra (A m , f) m (m-th direct power); 
from the relational point of view / is a (relational) homomorphism 
from the relational system (A; g) n = ( A n ; g n ) into (A; g). 

For permutations / G Sym(A) we define: f strongly preserves g if 
f[g] = g, i.e., if the mapping g — >• g : r i — > f[r] is bijective. 



1.3. The operators Pol — Inv. For F C Op(A), G C Sym(A) and 
Q C Rel(A) we define 



PoUQ 
poiy 1 q 

End^ Q 
wAut^ Q 
AutyiQ 

Inv,4 F 
Inv? 0 F 
Sub^F 
Con^ F 
slnv^ G 



= {/ G Op (A) | every g G Q is invariant for /} , 

= Op (n) (A) nPoUQ, (n G N) 

= Tr(A)nPoUQ (endomorphisms), 

= Sym(A) fl PoU Q (weak automorphisms), 

= {/ G wAutyi Q | f~ l G wAut^ Q} 
(automorphisms) , 

= {p G Rel(A) | g is invariant for every / G F} , 
= Rel (m) (A) fl Inv F {me N) , 

= Inv^ F (subalgebras), 

= Eqfllnv^F (congruence relations), 

= {p G Rel(A) | 

g is strongly invariant for every g e G} . 



Note that the notation Pol stands for polymorphisms, not for polyno- 
mials! The subscript A may be omitted if A is clear from the context. 
The notion of weak automorphism applies here to relational systems 
and differs from the weak automorphisms for algebras introduced by 
A. Goetz ([Goe66]) and investigated in many papers of K. Gtazek 
(e.g. [Gla97]). 

1.4. Clones. Recall, a clone is a set of operations (on a fixed base 
set A) which is closed with respect to composition and contains all 
projections e" G Op^ given by ef(xi, . . . ,x n ) = x*. The composition 
f[gi . . . , g n \ of / G Op<"> and gi, ... ,g n £ Op^ is the m- ary function 
defined by 



f\dl • • ■ ? 9n] (^1 j • • • i %m) • f {dl (-A j • • • > *^m) i ■ ■ ■ > 9n (*A j • • • t ^m)) ■ 
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The least clone containing F C Op (A) (i.e. , the clone generated by F ) 
will be denoted by (F) or (F)o p (a)- Likewise, (-f^)iY(A) and (G)sym(A) 
denote the submonoid of Tr(A) generated by H C Tr(A) and the 
subgroup of Sym(A) generated by G C Sym(A), respectively. 

1.5. Relational algebras (clones). We define the following (set- 
theoretical) operations: 

(1) A a (nullary operation: to contain the diagonal (or equality) 
relation A^ := {(a, a) \ a E A}), 

(2) fl ( intersection of relations of the same arity), 

(3) x ( product for m-ary g and s-ary a\ 

P x o :={(ai, • • • , a m , 6i, . . . , b s ) G A m+S \ (a u . . . , a m ) E g, 

(b u ...,b s ) e cr}), 

(4) pr 7 ( projection onto a subset I of coordinates: For m-ary g and 

I = {ii, ... ,it} with 1 < i\ < • ■ • < it < m we define 

pr i(g) := {(an, ■ • • ,a it ) | 3 a^j e {1, . . . , m) \ I) : 

(ai, • • • j a m ) G p} ), 

(5) tv a ( permutation of coordinates : For m-ary g and a permutation 
a of {1, ... , m} let 

^a(p) - = {(a Q (i), • • • , &a{m)) \ (ai, ■ ■ ■ , Om) G: p} ), 

(6) U ( union of relations of the same arity), 

(7) -i ( complementation : ->p := A m \ g for m-ary p) 

A set Q C Rel(A) of relations is called a relational algebra, weak 
Krasner algebra or Krasner algebra, respectively, if Q is closed with 
respect to (l)-(5), (l)-(6) or (l)-(7), respectively. These closures will 
be denoted by [Q]ra, [Q]wka and [Q]ka, resp. For finite sets A these 
algebras are also called clones (relational clone, (weak) Krasner clone). 
For infinite A see 2.4. 

Note that relational algebras contain relations of arbitrary (finite) ar- 
ity and are much more general than Tarski’s relation algebras ([Tar41]). 
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1.6. Remark. Relational algebras can be characterized also via clo- 
sure with respect to first order formulas. To each first order formula 
p(Ri , . . • , Rq, xi , . . . , x m ) with free variables xi,...,x m and m*- ary re- 
lation (predicate) symbols Ri (i G {1, . . . , <?}) one can assign an oper- 
ation (called logical operation) 

F v : Rel (mi) (^4) x • • • x Rel (m9) (H) ->• Rel (ra) (H) 

according to 

F (p (g 1 ,...,g q ) := {(ai,...,a m ) e A m \ \= (p(g u . . . , g q ; a u ■ ■ • , a m )} . 

Hereby, for atomic formulas e.g. Ri(x,y ) we interpret |= gi(a,b ) as 
(a, b) G gi (for elements a,b G A). E.g., the formula 

(p(Ri, R 2 ;xi,x 2 ) := 3z : Ri(xi,z) AR 2 (z,x 2 ) 

(with binary relation symbols) induces the operation relational prod- 
uct : 



F v {g\, £2) = {(<R, o 2 ) | Bz G A : (a u z) G gi A (z, a 2 ) G g 2 } = gi o g 2 
for binary relations g 2 , g 2 G Rel^(H). 

Let $(3, A, . . . ) denote the set of all first order formulas that contain 
only the indicated quantifier 3, the indicated connectives A, . . . and 
relation symbols and variables. 

Let Lop A (3, A, . . . ) := {F v \ ip G $(3,A, ...)} denote the corre- 
sponding logical operations. Then we have (cf. [PosK79, 2.1.3]) for 
Q C Rel(H): 

Q relational algebra -<=> Q closed w.r.t. Lop A (3, A, =) , 

Q weak Krasner algebra Q closed w.r.t. Lop A (3, A, V, =) , 

Q Krasner algebra Q closed w.r.t. Lop A (3, A, V, - 1 , =) . 



2 Galois connections and Galois closures 

2.1. Galois connections. We use here the canonical definition of a 
Galois connection between sets A and B, namely as a pair of mappings 
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<p : <P (A) — >■ fp (B), ip : *P ( B ) — > *P(A) induced by a binary relation 
/ C A x B via 



<p(X) \= {b £ B \ Va £ X \ (a,b) £ 1} , 
ip{Y) := {a £ A | V6 <E Y : (a,b) £ I } 

for X C A, Y C B. It is well-known that such pairs (<p, ijj) of map- 
pings can be characterized by the following properties: Both ip and ip 
are anti-monotone (i.e. inclusion-reversing) and their compositions pip 
and ipp are closure operators (i.e., extensive, monotone, idempotent). 
Therefore elements X £ *#{A) and Y £ *#{B) satisfying X = ip (ip(X)) 
and Y = ip(ip(Y)) are called Galois closed w.r.t. (p,ip). The sets of 
Galois closed elements are 

{ip(Y) \ YCB}C <$(A) and {<p(X) \ X C A} C «P(B) , 

and they form dually isomorphic complete lattices (w.r.t. inclusion). 

2.2. With the above definitions in 2.1, the (strong) invariance relation 
> (cf. 1.2) gives rise to the following Galois connections: 

Pol — Inv between Rel(A) and Op(^4) , 

End — Inv between Rel(A) and Tr(^4) , 
wAut — Inv between Rel(A) and Sym(A) , 

Aut — slnv between Rel(A) and Sym(A) . 

Whenever a Galois connection is given there arises the question of 
how to characterize the corresponding closure operators and the Galois 
closed sets. This is well-known for these listed Galois connections (cf. 
e.g. [PosK79]) and we summarize the results for finite A in the next 
theorem (for notation see 1.4, 1.5 and 1.6). 

Theorem 2.3. Let A be finite and let F C Op(A), H C Tr(A) ; 
G C Sym(A), Q C Rel(A). Then 

(F)op(A) = Pol Inv F , [<3]ra = Inv Pol Q , 

(#)ti-(A) = End Inv F , [<3]wka = Inv End Q , 

(G)sym(A) = Aut Inv G , [Q]ka = Inv Aut Q . 
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In order to include results for infinite sets A we still need some more 
notions and notations. 

— ^ 

2.4. Relational clones and local closures. For b = (ci, , c m ) G 
B m and a mapping / G A B let 

f[b] := (/(ci),...,/(c m )). 

For a family (ft)j e / of relations ft G Rel^ mi ^(A) (/ being an arbitrary 
index set) we introduce the general superposition with respect to given 
b G B m , bi G B m * (i G I) for some set B as follows: 

S (ijSi)^i)iei ■= {/P] G A m | / G A B and V* G I : /ft] G ft}. 

The strong superposition w.r.t. a G A m , Si G A mi is given by 

sS ( a,ai)(ftW := {/[«] I / e Tr(T) surjective and Vr G I : /[a*] G ft}. 

Let Q C Rel(A). The least set of relations which contains Q and the 
trivial (unary) relations 0, A, and is closed w.r.t. general superposition, 
will be denoted by [Q](sp 

Note that arbitrary intersections are a particular case of general su- 
perposition: flier Qi = S (6,6i)(^)*e/ for b = bi G A m (mi = m, i G I). 
Further we introduce local closure operators which are in fact a kind of 
interpolation operation (and can also be characterized topologically). 
Let F C Op(yl), G C Sym(A), Q C Rel(A), m G N + . Then 

m-Loc F := {/ G Op (ri) f4) | Voi, . . . , S n G A m 3g G F : 

f[ai,...,a n \ = g[Si,...,S n \}, 

OO 

LocF:= m-Loc F, 

m = 1 

Loc 0 (j := {/ G Sym(A) | Vn G N + Vo G A n 3g G G : f[S\ = g[S]}, 
ra-LOC Q := {g E Rel(A) | Vri, . . . ,r m e g3a e Q : 

C a C 0 } , 

OO 

LOC F := P| m-LOC F. 

m=l 

Let [Q] rc := LOC[Q](s) and [Q]wkc := l-LOC[Q]( S ). We call Q C 
Rel(A) a relational clone ( weak Krasner clone, resp.) if Q = [Q]rc 
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( Q = [Q]wkCi resp.). It should be noted that the definition of rela- 
tional clone here differs from that in other papers (e.g. [Pos80a]) where 
sets Q with Q = [Q](s) were also called relational clones. 

One easily checks that 1-LOC Q is the closure of Q with respect to 
arbitrary unions. 

Further, let [Q]kc be the closure of Q with respect to [.]wkc (weak 
Krasner clone), complementation (->) (cf. 1.5) and strong superposi- 
tion (symbolically we express this by (KC) = (WKC)&(-i)&;(sS)). Sets 
Q with Q = [Q]kq are called Krasner clones. 

A weak Krasner clone will be called a Pre-Krasner clone if it is closed 
with respect to strong superposition and contains the inequaltity re- 
lation u = va := {(x,y) € A 2 | x / y} . The corresponding 
closure is denoted by [Q]pkc (thus we have symbolically (PKC) = 
(WKC)&(i/)&(sS)). 

Remark: In [BorOO] it is shown that (Pre-)Krasner clones are just the 
(Pre-)Krasner algebras (cf. 1.5) which are closed w.r.t. strong super- 
position and arbitrary (i.e. also infinite) intersections and unions. If 
A is countable then one can even drop the closure w.r.t. strong super- 
position. 

With these notions we can state the following theorem which ex- 
tends 2.3 to infinite sets A. 

Theorem 2.5. Let F C Op(A), H C Tr(A), G C Sym(A), Q C 
Rel(A). Then 



Loc{F}op(A) = Pol Inv F , 
Loc(ff)Tr(A) = End Inv F , 
Loc 0 (G')tt(a) = wAutlnvG, 
Loc 0 (G) sym(A) = Aut slnv G , 



[Q]rc = Inv Pol Q , 
[Q]wkc = Inv End Q , 
[Q]pkc = Inv wAut Q , 
[Q]kc = slnv Aut Q . 



Remark: We have slnv G = Inv G whenever G < Sym(A) is a permu- 
tation group. Therefore e.g. slnv Aut Q = Inv Aut Q. 

In the next sections we report on further Galois connections which 
appear in connection with operations and relations. There are several 
(mutually connected) possibilities for specialization and generalization 
of those Galois connections considered in the previous section, e.g.: 
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(A) Restricting the operations and/or relations under consideration 
but keeping the invariance relation “p is invariant for /” . 

(B) Generalizing the operations (e.g. to partial operations or multi- 
operations) and/or relations with “canonically” modified invari- 
ance relation. 

(C) Considering “natural” closure operators on operations and/or 
relations and trying to characterize the closed sets as Galois 
closures of a suitable Galois connection. 



3 Galois connections with restricted operations and 
relations 

We start with the approach (A) which was already described in [Pos80a, 
15.1]. 

3.1. The Galois connections B Pol— K Inv. Let E C Op(A) and 
R C Rel(A). Then ^Pol — ^Inv defined by 

E Pol Q \= E n Pol Q for Q C R, 

B Inv F := R fl Inv F for F C E 

is a Galois connection between R and E. For motivation, E and R 
may be regarded as those sets of operations and relations which are 
of particular interest or especially important in some context. 

For arbitrary E and R, there is no general proceedure for how to 
characterize the Galois closed sets of operations or relations. However, 
if we assume that E is a locally closed clone and R is a relational clone 
then (by 2.5) there exist F 0 C Op (A) and Qo G Rel(A) such that 

E — Pol Qo and R = Inv Fo . 

Then we have the following characterization for the Galois closures. 

Theorem 3.2. Let E, R, F 0 ,Qo be as above. Then 

E Pol R lnv F = E n Loc(F 0 U F) 0p{A) for F C Op (A) , 

^Inv E Pol Q = R G LOCfQo U <%) = R 0 [Q 0 U Q] RC for Q C Rel(A). 
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For finite A the local operators can be omitted and [.]rc can be replaced 
by [•] ra • For E = Op(A) or R = Rel(^4) one can choose Rq = 0 or 
Fq = 0 . 

Proof. The result immediately follows from Theorem 2.5 and E D 
Pol Q = Pol Qo fl Pol Q = Pol(Qo UQ), Rn Inv F = Inv F 0 fi Inv F = 
Inv(F 0 UF). □ 

Remark 3.3. Let G := Poli? = Pol Inv Fo (then R = InvG). Every 
pair (E, R) as above gives rise to the induced closure operator 

els ( E ,g)(H) := E Pol K Inv H 

on the principle ideal 4 ,E =f CA E := {H \ H E La and H C E } in the 
lattice La of all clones in Op(yl). In case of finite A we have 

c\s {e , G )(H) = EA(GvH) (see Fig. 1) 

(A, V denoting meet and join in the lattice La) according to Theo- 
rem 3.2 (where the local operators can be omitted since A in finite); 
for infinite A one has to restrict to the lattice of locally closed clones. 




Fig. 1: The closure c\S(e,g){H) = E A (G V H) in \.E 
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Let E — A Ak and let G be the clone of all term operations of an algebra 
A = (A; ( fi)iei ) from a class /C of algebras. The cls( E;G )-closed sets 
H C A Ak (monoids in case k = 1) can be equipped with the algebraic 
structure inherited from A. They were investigated in [JakMP03] as 
so-called operation /C-algebras (transformation /C-algebras for k = 1) 
and generalize the concept of near-rings. 

In particular cases the Galois closures may be characterized more 
intrinsically. Note, e.g., that the Galois connections End — Inv and 
wAut — Inv coincide with E Pol — Inv for E = Tr(M) and E = Sym(A), 
respectively (and trivial R = Rel(A)). 

As a further example we mention the following characterization theo- 
rem where the arity of operations or relations will be restricted (E = 
Op (m) (A) or R = Rel {m) (A)). 

Theorem 3.4. Let m € N + , F C Op(A), Q C Rel(A). Then 

PolInv (m) F = m-Loc(F) 0p (A) , 

InvPol (m) Q = m-LOC[Q] RC = m-LOC[Q] (s) . 

Many other specializations of the Galois connection Pol — Inv were 
investigated, in particular, special Galois closures were characterized 
for various purposes. We collect some of these specializations in Ta- 
ble 1 below where we briefly mention which Galois closures have been 
characterized and give some references. The first two columns show 
how to choose the sets E and R in order to apply Theorem 3.2 (how- 
ever, although our approach via relational clones can be applied to 
every particular case there are often other characterizations which are 
more practicable; some details can be found e.g. in [P5s80a] and in 
the references of Table 1). 

In order to read Table 1 correctly we mention that sometimes the 
reader sees a set F of operations where a set of relations should be 
expected. Here we use the convention that an n-ary function / : A n — >■ 
A also may be treated as (n + l)-ary relation, namely the graph of f 
which we denote by 

f • {(®1; ■ ■ ■ i Q>ni f (®1» • • • i ®n)) | • • • j ^ A} . 

Thus, e.g., the so-called bicentralizer Pol Pol F of F means Pol(PolF')* 
(with F* := {/* | / G F} for a set F of operations). Note that e.g. 




242 



R. Poschel 



an automorphism / of an algebra A = (A; F) may be characterized in 
different ways: 

f £ Aut A 44- / E Aut F* /* £ Inv F . 

Once the Galois closures are characterized the way is open for applica- 
tions. As an example we take the so-called concrete characterization 
of related structures like the subalgebra lattice Sub A, the automor- 
phism group Aut A or the congruence lattice Con A of an algebra 
A = (A; F ). The problem may be posed as follows: 

The concrete simultaneous characterization problem: Given a set L C 
f}3(A) of subsets of A, a set C C Eq(A) of equivalence relations on 
A and a set G C Sym(A) of permutations on A. Does there exist 
an algebra A such that G is the automorphism group of A, L is the 
subalgebra lattice of A, and C is the congruence lattice of A ? 

Note that we do require exact equality (e.g. C = Con A.) and not 
equality up to isomorphism (the latter is the abstract characteriza- 
tion problem which sometimes is easier to answer and gives satisfac- 
tory globally valid results; here are some references for the abstract 
characterization problem: [Bir46], [BirF 48] , [GraS63], [Sch63], [Sch64], 
[GraL67], [Jon74], [SchT79]). 

The answer to the concrete characterization problem in principle is 
already provided with Theorem 3.2, but we shall make it more explicit: 

Theorem 3.5 (cf. [Pos80a, (7.2)]). For a given permutation group 
G < Sym(A) and given L C fp(A), C C Eq(A), there exists an algebra 
A — (A; F) such that G = Aut A, L = Sub A and C = Con A if and 
only if 



G' = Sym(A)* n [Q] RC 
L = <p(A) n [Q] rc 
C = Eq(A) n [Q] rc 

where Q := G* U L U C. The algebra A can be chosen with at most 
m-ary operations if and only */[Q]rc is replaced by 77?.-LOC[Q]rc. 

Proof. Note that / £ Aut A /* £ Sym(A)* D InvF, U £ Sub A 
& U £ Inv (1) F = SubF and 9 £ Con A 9 £ Eq(A) D InvF. 
Thus the theorem immediately follows from Theorem 3.2. (For more 
details we refer to [Pos80a]). □ 
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In Table 1 we also mention several further results for the concrete 
characterization problem. 

Tab. 1: Several closures of the Galois connections B Pol — ^Inv 
(E C OpQ4), R C Rel(A)) 



E 


R 


Galois closure 


References 


Op(A) 


Rel(A) 


Pol Inv F 


[BodKKR69a], 
[BodKKR69b], 
[Gei68], [BakP75], 

[Rom76], [Rom77a], 
[Rom77b] , [PosK79] , 
cf. 2.3, 2.5 






Inv Pol Q 


[Gei68], 
[BodKKR69a], 
[BodKKR69b], 
[Sza78], [PosK79], 

[Pos79], [Pos80a], 

cf. 2.3, 2.5 


Generalization to infinitary relations 


or operations 






Pol Inv°° F 


[Ros72], [KraP76], 
[Poi81] 






Inv°° PolQ 


[Ros79] 






Pol 00 Inv°° F "i 
Inv°° Pol 00 Q J 


[Kra76b], [KraP76], 
[Poi81] 


arity restrictions 


Op (A) 


Rel^{A) 


PolInv^ m ^ F 
Inv (m) PolQ 


[Gei68], [BakP75], 

[Pos80a] 

[Ros78] 


Op(T) 


Rel( 1} (T) 


Pol Sub F 
Sub PolQ 


[Sch82, Thm. 1.6], 

[Pos80a] 

see below 


OpOb(T) 


Rel(A) 


InvPol^Q 


[Sza78], [Pos80a] 






to be continued at next page 
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Table 1, 


continued from previous page 






E 


R Galois closure 


Reference; 


s 


Tr(A) 


Rel(A) Inv End Q "i 


[Kra38], 


[Kra50], 



End Inv F / [Kra76a] , 

[Kra86], [Gou68], 



[BodKKR69a], 
[BodKKR69b], 
[Pos80a], [BorOO] 



Sym(A) 


Rel(A) 


Inv Aut Q y 


[Kra38], 






(slnv Aut Q) 


[BodKKR69a], 






wAut Inv F 


[BodKKR69b], 






Aut slnv F J 


[Gou72a], [Pos80a], 
[BorOO] 


Sym(j4) 


Rel ( m \A) 


AutW m >F 


[Wie69] 




restriction to (graphs of) operations only 


Op(A) 


Op(AY 


Pol Pol F 


[Sza78, Thm. 13], 
[Faj77], [Dan77](for 
\A\ =3), (also Kuz- 
necov, cf. [Val76]) 


Op(A) 


Tr(A) # 


Pol End F 


[SauS82], ([Rei82] 

implicit operations) 






End Pol Q 


see below 




concrete characterization problems 






concrete characterization of Aut A 


Op(A) 


Sym(A)* 


Aut Pol Q 


[Jon68] (cf. [Jon72, 
(2.4.3)]), [Kra50], 

[ArmS64], [Sza75], 
[Bre76] 






concrete characterization of Aut A 






for algebras A 
operations 


with at most m-ary 


Op( m ) (A) 


Sym(A)* 


Aut Pol 


[Plo68], [Gou72a], 

[Jon72, (2.4.1)] 






concrete characterization o/Sub^4 


Op (A) 


V(A) 


Sub Pol Q 


[BirF48] (cf. 

[Jon72, (3.6.4)]), 

[Gou68], [Gou72b] 






to be continued at next page 
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Table 1, co 
E 


ntinued from previo' 
R 


is page 

Galois closure References 






for unary algebras: 
[Jon72, (3.6.7)], 

[JohS67] 


Op (A) 


Eq(A) 


concrete characterization of Con A 
Con Pol Q [Arm70] (partial 

solution), [Jon72, 
(4.4.1)], [QuaW71], 
[Wer74] , [Dra74] 

Pol Con F for p-rings (A; F) 

[Isk72] 


Op (A) 


TV (A)* 


concrete characterization of End A 
End Pol Q [Lam68], [GraL68], 

[SauS77a], [Sto69], 
[Sto75], [Jez72], 

[Sza78, Thm. 15] 


Op(A) 


Sym(A)* U ip (A) 
Sym(A)* U Eq(A) 

Op(A)* U Sub(A) 

Op(A)*USub(A) 
U Eq(A) 


concrete characterization of 
Autv4&Sub*4 [Sto72], [Gou72b], 

cf. 3.5 

A ut A & Con„4 [Wer74] (conjecture) 
cf. [Pos80b] , (for 
simple A [Sch64]), 
cf. 3.5 

Endyl &Sub A [SauS77b] (cf. 

[Jon74]) 

Aut A h Sub A & Con A 

[Sza78], [Pos80a], 

cf. 3.5 


Generalization to partial functions and multifunctions see 4.1 



4 Further Galois connections 

Now let us consider further Galois connections from the points of view 
(B) and (C) mentioned at the end of Section 2. 

Concerning (B) we mention here some generalizations of the opera- 
tions under consideration while the other side of the Galois connection 
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- the relations Rel(A) - remains unchanged. The invariance relation 
which induces the Galois connection is a more or less straightforward 
generalization and we refer to the references. 

4.1. Galois connections with generalized operations. 

• partial functions (i.e. mappings / : B — >• A with B C A n ): 
[Ros83] ([FleR78]), [HadR92] (cf. also [Fre66]), 

[Rom81], [B5rHP91], [SusK94], 

• multifunctions (i.e. mappings / : A n — > ^J(A)): [BorOl], [DreP], 

• heterogeneous functions (i.e. mappings / : A^ x A in — >■ A io for 
a family (Ai) ieI of base sets, i 0 , ii, . . . , i n E I): [Pos73]. 

Another generalization can be found in [PosROO] where cofunctions 
(i.e. mappings / : A — > {1, . . . , n} x A) and corelations (i.e. subsets 
g C {1 ,...,n} A ) are considered instead of functions and relations. 
A Galois connection which generalizes many features of functions, 
cofunctions, partial functions and multifunctions was investigated in 
[RofiOO]. 

Finally we demonstrate (C) point of view. 

4.2. Closures of relational systems as Galois closures. 

In Table 2 below we show how several closures of systems of rela- 
tions can be characterized by Galois connections. These closures can 
be classified by logical operations (cf. 1.6) of particular sets of first 
order formulas (cases (l)-(8)) or other operators on relations (cases 
(9)-(15)). 

However the investigations were not really motivated by this classifica- 
tion. In cases (l)-(8) the algebraic structure of the relational systems 
under consideration was of most important interest and gave such a 
hierarchy of relational systems. In cases (9)- (15) also the closed sets 
of operations were the starting point of interest. 

We mainly report on results from [BorPS] (cases (l)-(8)) and [Pos84] 
(cases (9)-(14)) and refer to these papers for further details. Some of 
the cases are already discussed in this paper and were mentioned in 
Table 1; however they fit into the scheme used here and therefore they 
are listed once more. 

Table 2 is organized as follows. The first column shows the closure by 
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indicating under which operations the relational system is to be closed. 
The next column gives a notation for the closure (if it was introduced 
somewhere). The last two columns indicate the Galois connection and 
briefly describe the Galois closed sets (the last column hereby shows 
what is the operational counterpart for the Galois connection under 
consideration). For simplicity we assume that the base set A is finite 
for the cases (l)-(8). The generalization to arbitrary base sets A for 
(1),(2),(3) is given in (9), (14), (15). In cases (6)-(9) one has essentially 
to consider fl-complete relational systems and locally closed opera- 
tional system in order to get the results also for infinite A (for details 
see [BorPS]). 

We shall provide here explicitly only those notions in Table 2 which 
concern the relational systems (for the operational part see the above 
mentioned papers). 

A Boolean system , briefly BS, is a set Q C Rel(A) of relations which 
is closed with respect to the Boolean operations (union U, intersection 
f], complement cf. also 1.5), contains as constants the sets 0 and 
A m (m G N + ) and is closed with respect to the following operation 
W s : Rel^(A) — » Rel^^A) for all mappings s : n — > m (n, m G N + ): 

W s (g) := {(ai, . . . ,a m ) G A m \ (a s( i), . . . ,a s(n) ) G p} for g G Rel {n) (A) . 

A Boolean system with identity, briefly BSI, is a Boolean system which 
contains the diagonal relation (cf. 1.5). 

A Boolean system with projections, briefly BSP, is a Boolean system 
which is additionally closed under the projections pr 7 (cf. 1.5). 

Tab. 2: Relational closures as Galois closures 





Closure 


Nota- 

tion 


Galois connection 


closed rela- 

tional system 


closed opera- 
tional system 


for finite base set A: 


(i) 


Lop A (3,A, V,i,=) 


[Q]ka 


slnv — Aut (cf. 2,3 and Tab. 1) 


Krasner alge- 
bra (cf. 1.5, 
1.6) 


group of per- 
mutations 



to be continued on next page 
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Closure 




Galois connection 


closed rela- 

tional system 


closed opera- 
tional system 


Table 2 , continued from previous page 


(2) 


Lop A (3, A, V,=) 


[Q]wka 


Inv — End (cf. 2.3 and Tab. 1) 


weak Kras- 

ner algebra 

(cf. 1.5, 1.6) 


monoid of 

unary func- 

tions 


(3) 


Lop^a, A,=) 


[Q]ra 


Inv — Pol (cf. 2.3 and Tab. 1) 


relational alge- 
bra (cf. 1.5, 

1.6) 


clone of fini- 
tary functions 


(4) 


Lop^(A, =) 




Inv — pPol 


weak system 
with identity 


down-closed 
clone of fini- 
tary partial 

functions 


(5) 


Lopyi(A) 




Inv — mPol 


weak system of 
relations 


down-closed 
clone of 

Unitary multi- 
functions 


(6) 


Lo Pa(3, A, V, -i) 


[Q]bsp 


slnv — sEnd (sbmEnd, resp.) 


BSP (cf. 4.2) 


Special mo- 

noid of unary 
functions (cf. 
[BorPS, 7.9]) 


(6’) 


(down-closed 
involuted mo- 
noid of bitotal 
multifunc- 
tions, resp.) 



to be continued on next page 
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Closure 




Galois connection 


closed rela- 

tional system 


closed opera- 
tional system 


Table 2 , continued from previous page 


(7) 


L°Pa(A, v, =) 


[Q]bsi 


slnv — spmEnd 


BSI (cf. 4.2) 


down-closed 
involuted 
monoid of 

pp-multifunc- 
tions (partial 
permutations) 


(8) 


Lop A (A, V, --) 


[Q]bs 


slnv — smEnd 


BS (cf. 4.2) 


down-closed 
involuted mo- 
noid of unary 
multifunctions 


for arbitrary base set A: 


(9) 


(KC) = 

(WKC)&(-.)&(sS) 


[Q]kc 


slnv — Aut (cf. 2.5 and Tab. 1) 


Krasner clone 
(cf. 2.4) 


locally closed 
group of per- 
mutations 


(10) 


(PKC) = 

(WKC)&(^)&(sS) 


[Q]pkc 


Inv — wAut (cf. 2.5 and Tab. 1) 


Pre-Krasner 
clone (cf. 2.4) 


locally closed 
monoid of per- 
mutations 


(11) 


(WKC)&(sS) 




Inv — sur-End 


Inv H for sets 
H of surjec- 
tive unary 

functions 


locally closed 
monoid of sur- 
jective unary 
functions 


(12) 


(WKC)&(-i) 




Inv — inb-End 


Inv H for lo- 
cally invertible 
monoids H 


locally closed 
locally invert- 
ible monoid 

of unary 

functions 



to be continued on next page 
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Closure 




Galois connection 


closed rela- 

tional system 


closed opera- 
tional system 


Table 2 , continued from previous page 


(13) 


(WKC)&(0 




Inv — inj-End 


Inv H for 

monoids H 

of injective 

functions 


locally closed 
monoid of 

injective unary 
functions 


(14) 


(WKC) = 
(RC)&(l-LOC) 


[Q]wkc 


Inv — End (cf. 2.5 and Tab. 1) 


weak Krasner 
clone (cf. 2.4) 


locally closed 
monoid of 

unary func- 

tions 


(15) 


(RC) = 
(S)&(LOC) 


[Q]rc 


Inv — Pol (cf. 2.5 and Tab. 1) 


relational 
clone (cf. 2.4) 


locally closed 
clone of Uni- 
tary functions 



There are many interdependences between the operations on relations 
mentioned in this paper. E.g., with intersections and the operations 
W s (cf. 4.2) one can express the operations (3) and (5) from 1.5. Of- 
ten it depends on the cardinality of the base set (finite, countable, un- 
countable) which operations on relations are needed for the characteri- 
zation of the Galois closed sets (detailed investigations can be found in 
[BdrOO]) and how they are interrelated (e.g. (KC) <=> (RC)&(-i) holds 
for finite and countable but not for uncountable A, [DroKMPOl]). 

Finally we mention that there is an interesting interpretation of the 
operations of relational algebras (and Krasner algebras) as operations 
of the Peircean algebraic logic (PAL) proposed also by R.W. Burch 
in [Bur91] and modelled mathematically in power context families, 
see [Pol02] and [ArnOl], for more details we refer to [HerP03]. PAL is 
closely related to the existential graphs that Peirce developed in the 
late 1890s, and to the conceptual graphs (cf. e.g. [Sow92]). 
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Galois Connections and Polynomial 
Completeness 

K. Kaarli* 



Abstract 

The aim of this paper is to emphasize the importance of the Galois theo- 
retic approach in the study of polynomial completeness problems. We show 
that reasoning in terms of Galois connections can be used for describing 
(local) polynomial functions, for classifying and introducing new types of 
polynomial completeness properties, etc. Special attention is paid to al- 
gebras with a majority term, in which case the local term functions of an 
algebra A are determined by the subuniverses of A 2 . 

AMS Mathematics Subject Classification: O 8 A 40 , 06A15. 

Key words: Galois correspondence , Polynomial completeness , Primality 7 
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1 Introduction 

In the earlier literature polynomially complete was a synonym of func- 
tionally complete. Here we use the notation and terminology of the 
monograph [10] where polynomial completeness is a property of alge- 
bras (or classes of algebras) whose characteristic feature is richness in 
polynomial functions, in some sense. So there are many polynomial 
completenesses, and functional completeness is just one of them. Re- 
call that a finite algebra A is called functionally complete if all Unitary 
functions defined on its universe A are polynomials. However, there 
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are many other completenesses, and a typical one is affine complete- 
ness: an algebra A is affine complete if its polynomial functions are 
exactly the congruence preserving functions. Since the term functions 
are special polynomial functions, we consider the property of primal- 
ity and its variations also as special kinds of polynomial completeness. 
Recall that a finite algebra A is called primal if all finitary functions 
defined on its universe A are term functions. One of several varia- 
tions of primality is endoprimality: an algebra A is endoprimal if its 
term functions are exactly the finitary functions defined on A which 
permute with all endomorphisms of A. 

The examples of polynomial completenesses given so far are strictly 
universal algebraic because they make sense for any class of (universal) 
algebras. If we restrict ourselves to a certain special class of algebras 
then it may happen that all polynomial (term) functions of algebras 
of that class preserve some specific relation which can be defined only 
for algebras of that class. In this situation it is reasonable to consider 
such algebras for which the polynomial (term) functions are exactly 
the functions on their universe which preserve that specific relation. A 
typical example is lattices and the order relation, and the correspond- 
ing polynomial completeness property is order functional complete- 
ness, also known as order polynomial completeness. The combination 
of order functional completeness and affine completeness is order affine 
completeness : a lattice is order affine complete if its polynomial func- 
tions are exactly the functions on their universe which preserve the 
congruences and the order relation. 

It is also possible (and useful) to consider instead of polynomial or 
term functions local polynomial or local term functions, respectively. 
For example an algebra (not necessarily finite) is locally functionally 
complete if all functions defined on its universe are local polynomials , 
that is, can be interpolated by a polynomial at any finite subset of 
their domain. It will turn out that the approach based on Galois con- 
nections is especially useful in the case of local versions of polynomial 
completenesses. 

2 The basic Galois connection 

Let A be an arbitrary nonempty set. We denote by T{A) the set of 
all finitary functions on A and by 7Z{A) the set of all finitary relations 
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on A. We also use symbols F n (A) and 7 Z n (A) for denoting the sets 
of all n-ary functions and relations, respectively, on A. Thus, every 
f £ F n (A) is a function from A n to A and every r £ 7Z n (A) is a 
subset of A n . Note that we do not require that a function / £ T n (A) 
be essentially n-ary. If F is any subset of F(A) then its n-ary part is 
F n = F fl F n (A). The n-ary part of R C 7Z(A) is defined similarly. 
Our Galois connection is generated by the fundamental binary relation 
“preserves” between F(A) and 7Z(A) which we describe now. Let 
/ £ F n (A) and r £ 7 Z m (A). We say that / preserves r if for every 
m x n matrix (a^) with all its rows in r, we have: 

(/ (®llj ■ • • ; ®1 n) i ■■■if (®ml? • • • ) ®mn)) G T ■ 

Instead of saying “/ preserves r” one often says: “r is /-invariant” or 
“/ respects r” or “r supports /” or (especially if r is an equivalence 
relation) “/ is r-compatible” . 

The context {fF(A), 7Z(A), preserves) generates the Galois connection 
between the ordered sets (p(F(A)), C) and {p{7Z{A)), C) in the usual 
way (say, as in formal concept analysis). This connection is established 
by the mappings: 

V : p(K(A)) -» p(T(A)), 

R M- R v — {/ £ F(A) | / preserves all r £ i?} 



and 



e ■ p(HA)) ^ p(R(A)), 

F !->■ F e = {r £ 71(A) | r is preserved by all / £ F} . 

This basic Galois connection can be extended to partial functions but 
also restricted to functions and/or relations of restricted arities, to 
equivalence relations, etc. 

The central question related to any Galois connection is to describe the 
Galois-closed members of the two given ordered sets. In the present 
situation this means to describe: a) the sets of relations having the 
form F e where F C F(A) and b) the sets of functions having the form 
R? where R, C 71(A). The answers to these questions are well known, 
especially in the case of a finite set A. In order to formulate them, 
we introduce the notion of relational clone, also known as relation al- 
gebra or coclone. In fact there is no generally accepted definition of 




262 



K. Kaarli 



the relational clone. Different authors use different basic operations. 
Perhaps the most economical definition is one due to R. Poschel [13] 
(see also R. Poschel’s contribution “Galois Connections for Operations 
and Relations” in this volume): the relational clone on A is a subset 
R C R{A) which contains all diagonals and is closed with respect to 
the so-called general superposition operation. The latter actually cov- 
ers many different simpler operations which are obtained by choosing 
appropriate functions the general superposition operation depends on. 
We prefer to give a definition containing more but simpler operations 
rather than a single complicated operation. 

Definition 2.1. A subset R ofTZ(A) is said to be a relational clone, 
if it contains the binary diagonal A^ = {(a, a) | a G A} of A and is 
closed with respect to the following operations: 

1. renumbering of coordinates, that is, assigning to r G 7 Z n (A) the 
relation r a = {( 2 ^( 1 ), . . . , x^ n f) \ (x \, . . . , x n ) G r} where a is a 
permutation on the set {1, . . . , n}; 

2. adding a coordinate, that is, assigning to r G R n {A) the relation 
r x A <E n n+l {A); 

3. projection, that is, assigning to r G R n +i the relation 

{(xi, . . . , x n ) | (3y G A) (xi,..., x n , y) G r}; 

4 . intersection, that is, assigning to r,s G 7Z n (A) their intersection 
r n s G R n (A). 

It easily follows from the definitions that every relational clone on A 
contains the diagonals A^ = {(a, . . . , a) G A n \ a G A} for every n, 
and their direct products. The set of all such “generalized” diagonals 
is the smallest relational clone on A. We denote it by T>(A). It is not 
difficult to see that the binary part of every relational clone is closed 
with respect to the usual relation product. 

Theorem 2.2. (see [14], [16]) Let A be a finite set. Then 

1. a subset F C IF (A) is Galois-closed iff it is a clone on A, that 
is, F contains all projection maps and is dosed with respect to 
superposition; 
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2. a subset R C 11(A) is Galois-closed, iff it is a relational clone on 
A. 

In case of an infinite set A it is impossible to characterize Galois-closed 
sets of functions and relations in purely algebraic terms; it is necessary 
to impose additional conditions of a topological nature. Given a subset 
F C T(A) we define its closure F as follows: a function / e F n (A) 
belongs to F iff it can be interpolated by members of F on every 
finite subset of A n . The clone F C F(A) is called a local clone on 
A if it coincides with F. We also have local relational clones ; all we 
need to define them is to add to the definition of relational clone the 
requirement of closure under arbitrary intersections. 

Theorem 2.3. ([13]) Let A be an arbitrary set. A subset F C F(A) 
is Galois-closed iff it is a local clone on A. A subset R C 1Z(A) is 
Galois-closed iff it is a local relational clone on A. 



3 Primality and its generalizations 

Let A = (A; F) be an algebra with universe A and set of fundamental 
operations F. Then F e is the collection of all subuniverses of A", 
n = 1,2, , that is, 

F g = |J{SubA n |n = 1,2,...}. 

It is well known that a function on A is a local term function of A iff 
it preserves all subuniverses of all finite direct powers of A. Hence, 

F eip = LocTerm A 

is the clone of all local term functions of A. An algebra A is called 
locally primal if LocTerm A = F(A), that is, if all functions on A are 
local term functions of A. Since F(A) e = 2? (A), we conclude that an 
algebra A is locally primal iff its finite direct powers have only trivial 
subuniverses. Here we call a subuniverse R of A” trivial if it is a direct 
product of diagonals A£, m < n. Since the congruence relations and 
the graphs of endomorphisms of A are subuniverses of A 2 , the locally 
primal algebras must be simple and have as their only endomorphism 
the identity map. Consequently, if we wish to generalize the notion of 
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local primality to nonsimple algebras or to algebras having nontrivial 
endomorphisms, we must require that the local term functions are 
exactly the functions which are compatible with all congruences or 
the functions which permute with all endomorphisms. Using our basic 
Galois connection the corresponding notions (and some other related 
ones) can be defined as follows. 

Definition 3.1. An algebra A is called: 

• locally congruence primal if (Con A)^ = LocTerm A; 

• locally endomorphism primal if (End A) v = LocTerm A; 

• locally automorphism primal if (Aut A) v = LocTerm A; 

• locally quasiprimal if (Partlso A)^ = LocTerm A. 

The corresponding non-local notions are defined by replacing the op- 
erator LocTerm with Term in the above definition. The reader may 
wonder why our preference is for the local versions of primalities. The 
answer is the following. In the literature the primal algebras are usu- 
ally finite by definition. Then the primality of A = (A; F) is equiva- 
lent to F e = T>(A). In the case of an infinite A the latter property, 
however, is equivalent to local primality, not to primality. So, in view 
of our basic Galois connection between Unitary relations and finitary 
functions local primality is a natural generalization of primality to 
infinite algebras. 

In the older literature subalgebra primal and congruence primal alge- 
bras were called semiprimal and hemiprimal, respectively. The term 
endoprimal is usually used instead of endomorphism primal. The sym- 
bol Partlso A denotes the local clone of partial isomorphisms of A, 
that is, the collection of all isomorphisms between subalgebras of A. 
Quasiprimality has turned out to be the most successful generalization 
of primality. Note that we have not presented here the complete list 
of primalities one can find in the literature. It should be mentioned 
that some of them, for example paraprimality, cannot be defined via 
our basic Galois connection. 

The usual way to show that an algebra A is, say, locally congruence 
primal is to take a function / e (ConA)^ and to prove that / is a 
local term function. An alternative way to do this is suggested by the 
fundamental theorem on Galois connections. 
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Theorem 3.2. An algebra A is locally congruence primal ConA 
generates the local relational clone |J{Sub A" | n = 1, 2, . . . }. Similar 
statements hold for the other local primalities listed in Definition 3.1. 

Proof We give the proof only for congruence primality. If A is locally 
congruence primal then (Con A) v = LocTerm A, implying that 

(ConA) w = (J{Sub A n \ n= 1,2,...}. 

This proves the necessity of our condition. The sufficiency follows by 
applying the operator ip to the equality (1). 

We shall denote by A + the algebra obtained from A by adding to 
F all elements of A as nullary operations on A. Formally we write 
A + = {A; F U A). Now the subuniverses of (A + ) n are precisely the 
subuniverses of A n which contain the diagonal A^. These are called 
diagonal subuniverses of AT Hence we may write 

(F u A)‘ = lJ{Sub d ia 6 A" | n = 1,2, . . . } 

= (J{Sub(A+)” | n = 1.2, 

The (local) term functions of A + are the (local) polynomial functions 
of A. Thus 

(F U A) 0tp = LocTerm A + = LocPol A . ( 1 ) 

An algebra A is called locally functionally complete if LocPol A = 
F{A), that is, if all functions on A are local polynomials of A. It 
follows from (1) that A is locally functionally complete iff A + is locally 
primal iff (F U A) e = D{A). 

Since polynomial functions do not preserve all subuniverses and do 
not permute with all (partial) endomorphisms, there are not so many 
generalizations of local functional completeness. In fact there is only 
one such generalization which makes sense for all varieties of algebras. 
It is based on the fact that congruences are preserved by polynomial 
functions. 

Definition 3.3. An algebra A is called locally affine complete if Con A 
= LocPol A. 

Analogously with Theorem 3.2 we have the following theorem. 

Theorem 3.4. An algebra A is locally affine complete i/f Con A gen- 
erates the local relational clone (J{Sub (A + ) n \ n = 1,2,...}. 
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4 Varieties with near-unanimity term 

Theorem 2.2 shows, in particular, that the notion of relational clone 
is an adequate tool for the formalization of the collection of all sub- 
universes of finite powers of a finite algebra. Due to Theorem 2.3, for 
arbitrary algebras a similar role is played by local relational clones. It 
is natural to ask if it is possible to formalize the collection of subuni- 
verses of A n , for a given n. This would correspond to the restriction 
of the basic Galois connection to the set of n-ary relations. An answer 
can be derived from general results of [13]. This answer, however, 
seems to be too complicated, in general. Here we present the solution 
recently obtained by J. W. Snow for the important special case where 
the clone of functions contains a near-unanimity function. 

Recall that a function / G T n (A), n > 3, is called a near-unanimity 
function if the equality /(eq, . . . , a n ) = a holds whenever all but per- 
haps only one of the elements oi, . . . , a n G A are equal to a. Let V be 
a variety of algebras of type r. We say that a term t of type r is a 
near-unanimity term for V if it induces a near-unanimity function on 
every A G V . 

A ternary near-unanimity term is called a majority term. If m is a 
majority term for a variety V then the identities 

m(x, x, y) ~ m(x, y, x) ~ m(y, x, x) & x 

hold in V . Every variety with lattice reduct has a majority term 
(x Ay) V (y A z) V (z Ax). 

Varieties with near-unanimity term have many interesting features 
(see [1] and [10]). For us the most important is the following. 

Theorem 4.1. Let V be a variety with an (n + l)-ary near-unanimity 
term and let A G V. Then (Sub A n ) v = LocTerm A. 

Hence, in the case of a variety with an (n + l)-ary near-unanimity 
term, if we want to check whether a given function on an algebra 
A is a local term function, it suffices to test it only on subuniverses 
of A n . This made it even more important to try to characterize in 
more or less feasible terms of A n the systems of subsets of A n which 
can appear as systems of subuniverses of some algebra A = (A; F) 
where F contains an (n + l)-ary near-unanimity function. For the 
case n = 2 and A finite this problem was posed by C. Bergman [2], 
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Since the collection of subuniverses of A 2 is closed with respect to the 
operations of taking converse “ intersection Pi and relation product o, 
C. Bergman asked: is it true that for every R C 72-2 (A) containing 
A 2 and A 2 , and closed with respect to the operations “ , fl and o, 
there exists a set F C T(A) containing a majority function and such 
that F e fl 72.2 (-4) = 77? The answer to a more general question was 
given by J. W. Snow in his Ph.D. thesis [16]. In order to formulate his 
result we define some operations on the set 7 2. n (A). First, for every 
mapping a : {1 , . . . , n} — > {1, . . . , n} let P a : A n — 1 A n be defined by 
the formula: 

Paifili • • • , Q>n) (®<t( 1) j • ■ ■ j • 

Then both P a and P~ l (taking of full preimage) can be considered 
as unary operations on 7 Z n (A). Second, we define an n-ary opera- 
tion c n which generalizes the binary relation product operation: given 
ri, . . . , r n G 7 Z n (A) the relation c n (r\, ... , r n ) is the set of all n-tuples 
(ax, , a n ) G A n for which there exists a n+ 1 G A such that 

(®1> ■ ■ ■ ) 1) ®n+l) G ^1) 

(®1> • • • j (^n—2i Q 71 ) ®ra+l) ^ • • • > (® 2i • • • i Qn+l) £ fn ■ 

Theorem 4.2. ([16], Theorem 2.10) Let A be a finite set. Assume that 
a subset R C lZ n (A) contains and A n , and is closed with respect to 
the operations fl, c n , P a and Pfi 1 for every mapping a. If RP contains 
an (n + 1 )-ary near- unanimity function, then R ve fl 7 Z n (A) = R. 

In case n = 2 Theorem 4.2 answers Bergman’s question. For any set A 
we consider the algebra F? 2 (A) = ( 72 . 2 (A); 0, 1,", D, 0 } where the nullary 
operation symbols 0 and 1 are interpreted as A^ and A 2 , respectively. 
It is easy to see that in case n = 2 all operations P a and Pfi 1 can be 
expressed via the operations of R 2 (A). Consequently Theorem 4.2 has 
the following corollary. 

Corollary 4.3. Suppose that A is a finite set and R G 72-2 {A). If RP 
contains a majority function, then R vs fl 72.2(A) is the subuniverse of 
R 2 (A) generated by R. 

In the sequel we indicate some applications of the latter result. It 
seems very likely that these results of Snow have infinite analogs. We 
conjecture that Corollary 4.3 remains valid in the case of an arbitrary 
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set A if we only require that R is closed with respect to arbitrary 
intersections. 

For later use we introduce the following notation. For an algebra A, 
the subalgebra of R 2 (A) consisting of all diagonal subuniverses of A 2 
will be denoted by S 2 (A). 

5 Order affine complete lattices 

A lattice L is called order functionally complete if {<} v = PolL and 
order affine complete if (ConL U {<}) </5 = PolL. One gets the local 
version of this notion by replacing the operator Pol with LocPol in the 
above definition. The study of order functionally complete lattices was 
initiated by D. Schweigert [15]. Later finite order functionally com- 
plete and order affine complete lattices were characterized by R. Wille 
([17], [18]), and M. Kindermann showed that tolerances and the order 
relation of a finite lattice determine its polynomial functions and that 
a finite lattice L is order functionally complete iff TolL = {A%,L 2 } 
([11]). Recall that a tolerance of an algebra A is a reflexive (in other 
words, diagonal) and symmetric subuniverse of A 2 . 

Kindermann’s original proof implicitly uses a Galois theoretic ap- 
proach. He shows that every diagonal subuniverse of L 2 , where L 
is a finite lattice, can be obtained from tolerances and the order rela- 
tions < and > of L using two binary operations: D and +, the latter 
denoting formation of the sublattice generated by two given sublat- 
tices. In view of Snow’s result (Corollary 4.3), the algebra S 2 (L) is 
generated by tolerances and the order relation < of L, for any finite 
lattice L. 

In [10] we extended the basic results about finite order functionally 
complete and order affine complete lattices to the infinite case. In 
particular, we proved the following analogs of Kindermann’s results. 
It is possible that the results were known earlier but we are not aware 
of the corresponding references. 

Theorem 5.1. ([10], Lemma 5.3.18) For an arbitrary lattice L, 
(TolL U {<}) v = LocPol L . 

Theorem 5.2. ([10], Theorem 5.3.40) A lattice L is locally order func- 
tionally complete iff A% and L 2 are its only tolerances. 
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Obviously the sufficiency part of Theorem 5.2 follows from Theo- 
rem 5.1. Indeed, if L has only trivial tolerances, then 

LocPolL = (TolL U {<})^ = {<}* , 

saying that the local polynomial functions of L are precisely its order 
preserving functions. 

In the finite case the necessity part of Theorem 5.2 can be easily 
derived from Theorem 5.1 and Corollary 4.3. Suppose that L is a 
finite order functionally complete lattice. Then {<1^ = PolL which 
implies 



{<}^n n 2 (L) = (PoiL) e n^ 2 (L) = Sub diag L 2 . 

Now Corollary 4.3 implies that the order relation < generates the 
algebra S 2 (L). However, it is easy to see that the subalgebra of R 2 (T) 
generated by < has the universe {<, >, A 2 ,L 2 }. In particular, L has 
no nontrivial tolerance relation. 

A similar way of reasoning works also in the case of order affine com- 
pleteness defined by the equality (Con Luj^})^ = Pol L. If we assume 
that L is finite and apply to this equality the mapping g then Corol- 
lary 4.3 implies that L is order affine complete iff the algebra S 2 (L) 
is generated by the congruences and the order relation of L. On the 
other hand it follows from Theorem 5.1 that the algebra S 2 (L) is gen- 
erated by the tolerances and the order relation of L. Consequently we 
have the following result. 

Theorem 5.3. A finite lattice L is order affine complete iff all toler- 
ances of L are contained in the subalgebra of S 2 (L) generated by the 
congruences and the order relation of L. 

This is quite close to R. Wille’s [18] characterization of finite order 
affine complete lattices. Wille’s original result is given in terms of 
decreasing V-endomorphisms but it can be easily translated into the 
language of tolerances. Given two tolerance relations S and T of a 
lattice L, we introduce their symmetrized relation product 

S *T = {{a,c)\(3be L) ( a A c,b) £ S, ( b , a V c) G T} . 

Then Wille’s theorem can be stated as follows. 
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Theorem 5.4. A finite lattice L is order affine complete iff every 
tolerance of L can be represented as a meet of symmetrized relation 
products of congruences of L. 

An infinite version of this theorem is presented in [10]. 

6 Local polynomial functions of algebras with 
distributive lattice reduct 

G. Gratzer ([7]) proved that all Boolean algebras are affine complete 
and then characterized affine complete bounded distributive lattices 
([8]). He proved, in particular, that every bounded distributive lattice 
L is order affine complete. This result was generalized to arbitrary 
distributive lattices by D. Dorninger and D. Eigenthaler ([6]). In the 
language of our Galois connection their result reads as follows. 

Theorem 6.1. Let L be a distributive lattice. Then 

(ConLU {<};r = LocPolL. (2) 

In other words, every distributive lattice is locally order affine com- 
plete. 

This result is based on the fact that every distributive lattice L is 
a subdirect power of the 2-element lattice D 2 and every congruence 
preserving function / E T n {L) can be represented via its “coordinate 
functions” : / = (fi)iei where every fi is an n-ary function on the Pth 
subdirect factor L* = D 2 of L. Clearly / is order preserving iff so are 
all the fi . Now a polynomial p which interpolates / at {a 1 , . . . , a m } C 
L n can be found by constructing, in a uniform way, the polynomials 
Pi E PolD 2 which interpolate fi at {a},..., a? 1 } C {0,1}". Note 
that the existence of the interpolating polynomials fi follows from 
Theorem 5.2. The important point is that these polynomials can be 
constructed in a uniform way. 

Theorem 6.1 stimulated the search for similar characterizations of local 
polynomial functions for other varieties which are close to the variety 
of distributive lattices. We describe here the result of this search in 
the case of Kleene algebras. The work was done jointly by M. Haviar, 
M. Ploscica, and the author of the present paper (see [9] and also [10]). 
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Recall that a Kleene algebra is a bounded distributive lattice equipped 
with an additional unary operation ', so that the following identities 
are satisfied: 

(x A y)' ~ x' V y', (x V y)' fa x' A y', 0 7 ~ 1, V ~ 0, 

x" fa x, (x A x 1 ) V y V y' fa y V y' . 

It is well known that the variety of Kleene algebras is generated by 
the 3 -element Kleene chain K 3 . (Obviously there is only one way to 
define a unary operation ' on the 3 -element lattice so that the result is a 
Kleene algebra.) Therefore every Kleene algebra is a subdirect product 
of copies of K 3 and its 2 -element subalgebra K 2 with universe { 0 , 1 }, 
which in fact is the 2 -element Boolean algebra B 2 . Trying to find 
an analog of Theorem 6.1, we first studied the diagonal subuniverses 
of K 3 2 . It turned out that the algebra S 2 (K 3 ) is generated by the 
so-called uncertainty order 

— = {( 0 ) 0 ), (a, a), ( 1 , 1 ), ( 0 , a), ( 1 , a)}, 

where a is the third element of K 3 . Hence we have the equality 

= PolK 3 . As the next step, we showed that using the subdirect 
decomposition, there is a natural way to extend the uncertainty order 
to an arbitrary Kleene algebra. As in the case of distributive lattices, 
it was possible to construct the interpolating polynomials for subdirect 
factors in a uniform way. The final result was the following. 

Theorem 6.2. Let K be a Kleene algebra. Then 

(Con L U {C})^ = LocPol K . (3) 

Using this description of local polynomial functions it was not difficult 
to characterize locally affine complete Kleene algebras. The necessary 
and sufficient condition we obtained is given in lattice theoretic terms 
but its actual meaning is that all congruence preserving functions must 
also preserve the uncertainty order (formally, that C E (ConL)^). 
Thus the formula (3) gives a hint for how to find a characterization of 
locally affine complete Kleene algebras. 

Kleene algebras form a small subvariety of the variety of Ockham al- 
gebras [3], which can be defined as bounded distributive lattices with 
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an additional unary operation ' satisfying the identities of Kleene alge- 
bras, except the last two. In [10] we found an analog of Theorems 6.1 
and 6.2 for Stone algebras. The latter form a subvariety of the variety 
of Ockham algebras defined by the additional identities 

lAi'fsO, x 1 V x" ~ 1 . 

It turned out that in the case of Stone algebras the role of < in (2) 
and □ in (3) is played by the relation 

<g = {(£, y) | x' = y' and x < y} 

which we called the Glivenko order. Recently my doctoral student 
V. Kuchmei observed that there is a common definition of the un- 
certainty order of Kleene algebras and the Glivenko order of Stone 
algebras. Moreover, he found analogs of Theorem 6.2 for some other 
varieties of Ockham algebras, for example for de Morgan algebras. 
However, he also noticed that a similar characterization of local poly- 
nomial functions cannot be extended to all Ockham algebras. These 
results are not published yet. 

7 Endoprimal algebras 

An algebra A is called endoprimal if (End A) 1 * 3 = Term A. L. Marki 
and R. Poschel ([12]) proved that a distributive lattice is endoprimal iff 
it is not relatively complemented. B. Davey ([4]) observed that endo- 
primality is closely related to the theory of natural dualities. Namely, 
he noticed that every finite algebra which is dualisable by its endomor- 
phism monoid, is endoprimal, too. Using the machinery of the theory 
of natural dualities, he and J. G. Pitkethly ([5]) produced a number of 
endoprimal algebras in various classes of universal algebras. Recently, 
in joint work with H. Priestley on endoprimality of Kleene algebras, 
we discovered an alternative direct approach for describing endoprimal 
algebras. The results are not published yet, so we only briefly outline 
the main ideas of our approach. It is based on the following theorem 
whose proof is not very complicated. 

Theorem 7.1. Assume that the algebra A is subdirect in n? e / Aj and 
that there exists an algebra M such that the following conditions hold: 
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• A i is a subalgebra of M for each i G I; 

• A has a subalgebra S isomorphic to M; 

• there exists pel such that A p — M. 

Then , given any function f G (EndA)^, there exists a function g G 
T n (M) such that f = (fi)tei where every fi is the restriction of g to 
Ai and the equality 

aifiix 1 , ..., x n )) = fj(a(x l ), ..., a(x n )) 
holds for every homomorphism a : Aj — > Aj. 

An immediate corollary of Theorem 7.1 is that if the three conditions 
are satisfied and the algebra M is endoprimal, then so is A. In par- 
ticular, every Boolean algebra B is endoprimal because B contains a 
2-element subalgebra (unless it is trivial) and can be represented as 
a subdirect product of 2-element algebras. However, the 2-element 
Boolean algebra B 2 is primal and then obviously endoprimal. 

In general, the importance of Theorem 7.1 is that it reduces the ques- 
tion of endoprimality of A to a question about functions on a smaller 
algebra M. It is especially useful if we have a majority term and M 
is finite, because then every function on M can be effectively tested 
with respect to being a term function. 

The following theorem is an almost direct consequence of Theorem 7.1. 

Theorem 7.2. Assume that M, A, and A i, i G I, satisfy the con- 
ditions of Theorem 7.1. Also suppose that M is finite and generates 
a variety which has a majority term. Then A is endoprimal if the 
algebra S 2 (M) is generated by the set 

{Hom(Aj, Aj) | i,j G /} U {A i:j \i,j G I, i^ j} . 

Here Hom(Aj, Aj) denotes the set of all graphs of homomorphisms 
from A i to Aj and Aij is the canonical projection of A in Ai x Aj. 
Theorem 7.2 shows, in effect, that if the assumptions of the theorem 
are satisfied then A is endoprimal if it has enough homomorphic im- 
ages. For example, it easily implies the sufficiency part of Marki’s 
and Pdschel’s theorem about endoprimality of distributive lattices. 
Indeed, if a distributive lattice L is not relatively complemented then 
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it has a 3-element homomorphic image. This allows us to embed L 
subdirectly into D 2 r so that one of the 2-fold projections Lij is a 3- 
element lattice. Clearly the 3-element subuniverse of D 2 2 generates 
the algebra S 2 (D 2 ). 

One can use similar arguments for finding sufficient conditions for 
endoprimality in the case of other algebras with majority term, for 
example in the case of Stone and Kleene algebras. It is possible to 
show, using Corollary 4.3 and adding one more natural assumption 
on the subdirect decomposition, that in the finite case the condition 
of Theorem 7.2 is necessary, too. However, so far we have not found 
a good application of this result because all known proofs of non- 
endoprimality are based on explicit construction of a function / e 
(EndA)^ \ Term A which remains valid in case of infinite algebras. 
For example, if L is a relatively complemented distributive lattice 
then we get a suitable function / by defining f(x, y,z) = u iff u is the 
complement of y in the interval [x A y, y V z\. 

Acknowledgment I wish to thank Reinhard Poschel who helped me 
to find necessary references and kindly let me use his unpublished 
manuscript. 
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Abstract 

The aim of this survey is a presentation of two kinds of Galois connections, 
which appear in Universal Algebra but are not well known. They are related 
to the notions of general independence and of weak automorphism. There 
are still interesting open problems from these areas. 
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1 Introduction 

Classical Galois theory for fields establishes the correspondence be- 
tween extensions of fields and field automorphisms. G. Birkhoff, O. 
Ore, J. Riguet (see [1], [38], [43]) and others have investigated suitable 
general correspondences between some ordered sets. Such correspon- 
dences are extensively used in formal concept analysis (see [7]). 
Moreover, there are several Galois connections (Galois corresponden- 
ces) used in Universal Algebra. For instance, the Galois correspon- 
dence between identities and varieties (of the same type) is well- 
known (see [1], 3rd ed., Chapter VI). A more general Galois correspon- 
dence between hyperidentities and hypervarieties (of the same type) 
has recently been studied by many authors (see [4]). On the other 
hand, the Galois connection Inv-Pol, a connection between functions 
(operations) and relations, was discovered by M. Krasner and A.V. 
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Kuzniecov, and intensively explored by many researchers, e.g., by L.A. 
Kaloujnine (Kaluznin), R. Poschel, I. Rosenberg and others (see [2], 
[24], [28], [42], [41], [44], [45], [46], [53]). 

In this paper, we want to describe two other Galois correspondences 
which are less well known, namely the Galois connections related to 
the notions of independence (with respect to a set of mappings) and 
of weak automorphism. 

Some interesting problems from these areas are still open. Among 
others, it would be interesting to know a description of the Galois- 
closed sets with respect to the Galois connection related to the notion 
of weak automorphism. This problem is still open and is equivalent 
to the concrete characterization problem of weak automorphisms (see 
Section 4). In the last section we also propose a general setting for 
the connections studied here. 

Investigations of the general notion of independence and of weak ho- 
momorphisms of general algebras were initiated by E. Marczewski, his 
students and co-workers. 

2 Preliminaries 

Let A be nonempty set and O^(A), R(”) (A) the set of all n-ary 
operations and the set of all n- ary relations of A, respectively (for 

OO 

any positive integer n). Let us put 0(A) = [J 0( n )(A) and R(A) = 

n—1 

oo 

|J R( n )(A). The set of all projections will be denoted by E(A). A 

71—1 

subset C(A) C O(A) is called a clone (in the sense of P. Hall) over 
the set A if it contains the set of all projections and is closed under 
compositions of operations (see [3]). If F C O(A), then (F) denotes 
the clone generated by F (i.e. the smallest clone containing the set F). 
The set of all clones over the set A will be denoted by £(A); this set 
forms a lattice, under inclusion. For a given algebra 21 = (A; F) the set 
of term operations of this algebra is the clone (F) , which is denoted by 
T(2l). Two algebras 21 = (A; F) and 03 = (A; G) are term equivalent if 
they have the same sets of term operations, i.e. (F) = (G) . Similarly, 
for a subset B C A, we denote by (H) a (or just ( B )) the subalgebra 
of the algebra 21 = (A;F) generated by B. 
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The first Galois connection we shall describe uses the concept of in- 
dependence. Let 21 = (A;F) be an algebra with a carrier A and a 
set F of fundamental operations. A nonempty set I C A is called 
M-independent (or independent in the sense of Marczewski; see [35], 
[36] and [9]) if for any finite system oq, ..., a n £ I of different elements 
and for any pair f,g£ (21) of n-ary term operations the equality 
f(ai , ..., a n ) = g(ai , ..., a n ) implies / = g in A. We also assume (unlike 
the assumption in [35]) that the empty set is M-independent. This 
notion is a special case of the notion of independence with respect to 
a set of mappings, to be considered in Section 3. 

Now we introduce the necessary background for the second Galois con- 
nection we shall consider, using weak automorphisms. For an algebra 
21 = (A; F) let r be a mapping from A into A. Consider the following 
two conditions: 

(a) For every f £ X(2l) (say: n-ary) there exists g £ T(2l) such that 
(*) (Voi, • • • , a n £ A) (r (/ (oi, . . . , a n )) = g (r (oq) , . . . , r (a n ))) . 

One can verify that if r is “ onto” , then g is uniquely determined (see 
[19]). 

(b) For every g G T(2l) (say: n-ary) there exists / £ T(2l) such that 
the equality (*) holds. 

One can observe that if r is “ 1-1” , then / is uniquely determined. 
The mapping r is said to be a semi-weak endomorphism (see [11], [31]) 
of the algebra 21 if condition (a) holds. A semi-weak endomorphism 
is a special case of an r-morphism in the sense of A. Goetz [22], and 
is related to the notion of anamorphism in the sense of W. S. Hatcher 
and S Whitney [27]. 

If the mapping r satisfies condition (b), then r is a special case of 
an l-morphism (see [22]) and is related to the notion of catamorphism 
(see [27]). Moreover, if r fulfills both conditions (a) and (b), then r is 
said to be a weak endomorphism of the algebra 21 (see [19]). 

Every permutation (bijection) a of the set A induces a bijection a : 
f g of Q(A), where the operation g is defined by / and a in the 
following way: 



g(xi,...,x n )=a(f)(xi,...,x n ) = a(f(a ^Xn)))- 
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We denote by Sa the set of all permutations (bijections) of A. If, for 
the permutation a E Sa and the algebra 21 = (A;F), the equality 

a{%{ 21)) = X(2l) 

holds, then a is said to be a weak automorphism of the algebra 21. 
It is worth adding, that - in the definition of a weak automorphism 
- it is not enough to assume the inclusion d(X(2l)) C X(2l) (in this 
case er is a semi- weak endomorphism with injective a). We denote by 
Aut (21) and W Aut (21) the groups of all automorphisms and all weak 
automorphisms of the algebra 21, respectively. One can verify that 
Aut (21) is a normal subgroup of the group W Aut (21) (see [48], [39]). 

3 The Galois connection related to Q-independence 

In all further considerations of this section, we fix an algebra 21 = 
(4 F). 

We will denote by M (A) (or by M for short) the set of all mappings 
p : T — >• A from all nonempty subsets T C A to the carrier A, i.e. 
M(A) = {p : p € A T , T C A}, and by H(A) (or just H) the set of all 
mappings p : T — »■ A (for T C A), which possess an extension to a 
homomorphism p : (T) — y A (p\ T = p). 

E. Marczewski ([36], p. 137 and [37]) observed that several notions 
of independence weaker than M-independence fall under a common 
scheme, namely one can express them in the language of mappings 
and their extensions to homomorphisms. He proposed the following 
definition: 

Definition 3.1. A nonempty set T C A is called independent with 
respect to the set Q C M(A), or Q -independent for short, ifQC)A T C 
H(A). 

The system of all Q-independent sets will be denoted by Ind%(Q) ( or 
Ind(Q ) for the fixed algebra 21). A subset which is not Q-independent 
is called Q- dependent. 

A number of versions of independence can be viewed as examples 
of Q-independence. If we put Q = M, we obtain M-independence, 
i.e. independence in the sense of Marczewski. Taking Q = S = 

jp | p G (T) t , T C A} gives the 5-independence introduced by J. 
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Schmidt in [47]. A similar claim holds for S'o-independence, where 
So — {p | T T , T C A}, introduced by S. Swierczkowski in [49]. 
Moreover two other notions of independence were defined and in- 
vestigated in [9]. These are Ax-independence, defined by putting 
Q = Ax = {/|t | / € C A}, and ^-independence, obtained 

by taking Q = R, where R C M is the subset of all injective map- 
pings. 

A mapping p £ A T , where T C A, is called diminishing if for every 
f,g £ and for each a £ T the equality f(a) = g(a) implies 
f(p(a)) = g(p(a)). If Q = G is the set of all diminishing mappings, 
then Q-independence becomes the G-independence introduced by G. 
Gratzer in [25] (and modified in [37]). 

It is worthwhile to remark that Q-independence satisfies the following 
important properties, needed for a Galois connection: 

(1) (VQx, Q 2 C M) [Qi C Q 2 => Ind(Q 2 ) C Ind(Qi)] , 

(2) (VQ C M) [ind(M) C Ind(Q ) C 7nd(i7) = 2 A ] , 

(3) for an arbitrary family Qi C M,i £ I 

f| Ind(Qi) = Ind(\J Q t ), (J Ind(Qi) C Jnd(fl Qi)- 

iei iei iei iei 

These properties suggest that we can consider a special Galois con- 
nection 

M D Q i->- Ind(Q ) C 2 a and 2 A D J >->• Qj C M, where Qj is the 
maximal (with respect to inclusion) set of mappings with Ind(Qj) = 
J. We develop this Galois connection as follows. 

For an algebra 21 we will construct a formal context in the sense of R. 
Wille (see [52] and [7]), in which the objects of the context are subsets 
of the carrier of 21 and the attributes are mappings (see [20]). 

Let 2 a be the power set of A. We define a relation g C 2 A x M in the 
following way: 

For a nonempty subset X £ 2 A and a mapping p £ M one declares 
X Qp& (p$l A x ) V (p £ H). 

We also assume that 



(Vp £ M) [0 qp}. 
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Now, the dually adjoint operators forming our Galois connection are 
defined as follows: 

MDQ^ 1 (Q) = {X C A | (Vp e Q) [X 0 p ]} , 

2 a DJ^ Q(J) = {peM | (VI £ J) [X e p}} . 

Let us define Galois closure operators by the equalities: 
y(J)=J(Q(J)), 

A(Q) = Q(1(Q))- 

In this way, we have constructed the formal context 3 = (2 A , M, g) of 
subsets and mappings. We will call 3 a context of Q-independences. 
For Q C M, we have Ind(Q) = Z(Q). 

One can consider, for an arbitrary set Q C M, a maximal (with respect 
to inclusion) set Q such that Ind(Q ) = Ind(Q ) (see [9]). It was proved 
that: 

(4) for a family J of subsets of the carrier A of 21 there exists a 
set Q of mappings such that 

J = Ind(Q) iff Ind(M) C J. 

It is known (also from [9]) that the set £2(21) of all maximal sets 
of mappings with set-theoretic join and meet operations, and with 
complementation defined by 
(a) Q' = (M\Q) U H, 

is a complete atomic Boolean algebra. 

In a context 3 these families ( Q and J) are determined by the closure 
operators A and V- 



Q = A ( Q ) Q = Q, 

J = v ( J) ^ Ind(M) C J. 

In further considerations, a set Q = A (Q) of mappings is said to be 
closed instead of maximal. Similarly, J = \/(J) is called a closed 
system of subsets of A. 

Both closure operators y an d A are additive, meaning that 

Vdj4) = |J V(A), where A, C 2 A , 

iei iei 
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A(|jQ0 = U A(Qj), where Q { C M. 
iei iei 

For an algebra SI, a set X C A is called C-independent if, for any 
a G X, a (X \ {a}) . The system of all C-independent sets of the 
algebra 31 will be denoted by C-Ind% (or C-Ind if SI is fixed). It is 
easy to see that Ind(M) C C-Ind. Therefore, by (4), there exist a set 
Q C M of mappings such that C-Ind = Ind(Q ) (see [9]). 

Similarly to the definition (a) above for the complementation of closed 
sets of mappings, we define a complementation on closed systems of 
(independent) subsets of A as follows: 

(/?) J' := (P \ J) U v(0) = (P \ J) U Ind(M), 

for any closed set J C 2 A = P. 

If a subset T is not M-independent and is Q-independent for some 
Q C M, then T is (A(Q))'-dependent. 

There is a simple relationship between complementations in lattices 
of closed sets of subsets of A and closed sets of mappings, namely: 

(5) for any system Q C M, we have Z((A(Q))') = (1(Q))'. 

We know (see [9]) that: 

(6) if the set 1(Q) is hereditary 1 , then the set X ((A((5)) / ) is of finite 
character 2 . 

From statement (5), one can easily conclude that the converse of im- 
plication (6) is not true. 

The lattice T(2t) of all v~ c l° se d sets, with set-theoretic join and meet 
operations and complementation defined by (/?) , is dually isomorphic 
to £2(21). 

For any subset T C A, T ^ Ind(M), we have: 

(7) the set A T UH is an atom of £2 (SI) and { T}Ulnd(M ) is an atom 
of 3(21), 

1 A system J of sets is hereditary if from X G J follows that every subset Y of 
X belongs to J. 

2 A system J of sets is of finite character if a set T belongs to J whenever any 
finite subset F of T is in J. 
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( 8 ) the sets (M \ A T ) U H and P\{T} are co-atoms of the lattices 
3 ( 21 ) and £ 3 ( 21 ) respectively. 

Let us recall another notion of independence, the so-called t-indepen- 
dence, introduced by J. Plonka (see [40] and [23]). For an algebra 
21 = (A; F) a set X C A is called t-independent if for any finite system 
of different elements ai, ...,a n G X and for any n-ary term operation 
/ G T^(2t) we have 

f (ai, a„) ± ai. 

We denote by t-Ind<& (or t-Ind if 21 is fixed) the system of all t- 
independent sets of the algebra 21 . 

It is easy to show (see [23]) that: 

(9) Ind(M) C t-Ind C C-Ind ; 

(10) X is t-independent iff / (ai, ..., a n ) ^ X for any non-trivial term 
operation / G < X^"^(2t) and different elements ...,a n G X. 

One can construct (in a rather artificial way, see [9]) a set Q C M of 
mappings such that 

(7) t-Indfn = Indm(Q), 

for any algebra 21. The following problem (see [10], Problem 7.5) seems 
to be interesting and still open: 

Problem Does there exist a uniform method for constructing a set Q 
(valid for all algebras 21 ) such that equality ( 7 ) holds? 

4 The Galois connection related to weak 
automorphisms 

In this section we want to describe another Galois connection, this 
time a connection between bijections of a set A and clones over A. 
Let a G Sa and ® G £, where C — £(A) is the set of all clones over A 
(see Preliminaries). Then we define a relation 0 between elements of 
Sa and £{A) as follows: 



(J0B d(B) = B, 
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where a is a mapping induced by o (or in other words u is a weak 
automorphism of the algebra (A, B); see Preliminaries). We will say 
that a preserves the clone B. 

The relation 0 determines a Galois correspondence (see [13], [17], [18]) 
between subsets G C Sa and families T of clones over A, given by the 
two mappings: 



G ^ F(G) = {le C(A) | (Vct E G) [a 0 B]} , 

T ^G(T) = {a E S A | (VB eT) [a 0 B]} . 

Note some simple properties of this mappings (see [17], [18]), namely: 

(i) G({E}) = G({0(°)}) = G({©}) = Sa, where 0(°) is a set of all 
constant operations of the set A, 

(ii) E, O E f(G), 

(iii) G({B}) = WAut({A,B)), 

(iv) if G < Sa (he. G is a subgroup of Sa) and B E ^F(G), then 
G < WAut((A,B)), 

(v) if Bj G 3~{G) (for i G /), then ^JjBj^ £ .F(G) and more 
generally T{G) is a sublattice of the lattice of all clones over A, 

(vi) J-{G) = !F({G)) = P| -7 7 ({c r }), where (G) is a subgroup of Sa 

<reG 

generated by the set G C Sa, 

(vii) G{T) = n G({B}) < Sa. 

Define the operators V : 2 s A — > 2 s A and A : 2 C — > 2 C in the following 
way: 



V(G) = G(T(G)), 
A(T) = T(G(F)). 
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Of course, V and A are closure operators. 

In other terminology (see [24]), if G < Sa, then clones from F{G) are 
called G-dones and T{G) is the lattice of all G-clones. It follows from 
general facts on Galois-closed subrelations that are discussed in the 
paper [5] of this volume that the lattice of all G-clones is a complete 
sublattice of the lattice of all clones. 

From property (vii), we know that V(G) is a subgroup of Sa for every 
G C Sa- The question of whether the converse implication is true is 
very natural. So, sometimes ago we formulated the following ques- 
tions: 3 

Question A. Is it true that v(G) = G iff G < Sa ? 

This problem is equivalent to the next two problems: 

Question B (concrete characterization problem). Is it true 
that for every group G < Sa there exists a clone B E C{A) such that 
WAut((A,M)) = G? 

Question C (see [24]). Are the lattices of Gi-clones and G 2 -clones 
different for different groups G\,G 2 < SU? 

If the answer will be affirmative, then the lattice of lattices of G-clones 
would be dual to the lattice of subgroups of Sa- But recently we were 
informed by A. Szendrei that this is not the case. 4 

Example. Let O / B C 4 and let F be the set of all mappings 
fulfilling the following condition: 

f(xi , . . . , x n ) E B iff Xi E B for some 1 < i < n. 

The set T = F U E is a clone and T E T{G) iff B is invariant under all 
permutations from G < Sa- Then one can observe that if the subgroup 
G is maximal with respect to inclusion, having the set B as a union 
of orbits, then v(G) = G. 

Let £ be the identity permutation on set A. Using this example for 
A = {0, 1, 2} we can easily show that {e, (01)}, (e, (12)} and {e, (02)} 

3 4 see “Added in proof” at the end of this paper. 
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are V-closed. We know that there are infinitely many elements in 
and there are 48 elements in T{Sz) (see [24]) which shows us 
that A 3 is V-closed. Later on we will observe that {e} is V-closed. 
For solving this problem, another Galois connection is helpful, namely 
the connection between all Unitary operations of the set A and all 
relations defined on A (see [2] and [42]). Now, we recall some notions 
which we need in order to describe this Galois connection. 

Let / G (A) and p G (A). We say that the operation / pre- 
serves the relation g whenever 



(Varn, • • • , X \ m , X 21 , ■ ■ 



7 %2mj • • • 5 Xnl, * * * 5 % nm 



eA) 



[(V . 7 e {1, . . . , m}) ((xij, . . . , x nj ) g g) => 

((/ (xn, . . . , x lm ) , / (x 2 i, . . . , x 2 m ) (x n i, • • • , x nm )) e p)]. 
For FCO(4) and § C K(yl) let us define two operators 

Pol (S) := { / 6 0(^4) | / preserves each g e S}, 



Inv (F) := {p G M(^4) | each / G F preserves p} . 

These operators Pol and Inv form the Galois connection mentioned 
above (see [2], [42] and [46]); and for each F C 0(/l), Pol ( Inv (F)) = 
Loc({ F)) where 

Loc(F) = {/ G O^(A) | n > 1, (V finite# C A n ){3 g G W)[f\ B = 
9 \b ]} 

is the so-called local closure of the set F (see [46]). If the set A is finite 
then the Galois -closed sets of operations are exactly the clones. 

For a permutation a G Sa and a relation p G (A) let 

o (p) := { (<t (ai) , . . • , a (a n )) | (a x , . . . , a n ) G p} . 

It is known ([24]) that for S C R(A) , a ( Pol (S)) = Pol (a (S)). If the 
permutation a maps the set S onto itself, then so does a. If the set of 
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relations § is Galois-closed, then for the clone C = Pol (§) , a (C) = C 

iff ct(S) = S. 



Now, let us return to the example above. In order to show that the sub- 
group {e} < £3 is Galois-closed (V-closed with respect to the Galois 
connection related to weak automorphisms), one can easily show that 



the only permutation which maps the clone Pol 



0 1 2 
1 2 1 




7 r = 



0 1 2 
1 2 1 



is a binary relation in which elements are written as 



columns) onto itself, is the identity e. Indeed, let / be an operation of 
one variable which preserves the relation n, and let / (0) = 1. Then we 
must have / ( 1 ) = 2 and / (2) = 1. If a G S3, then let g = a (/), i.e. 
g{x) = a (/ (a -1 (x))). If o = (01) , then g (0) = 2 and g (1) = 0, and 
thus g does not preserve the relation 7r. Analogously we can show that 
the only permutation a e S3, for which g preserves 7 r, is the identity. 



The Galois connection considered in this section, between permuta- 
tions and clones, can be extended to a Galois connection between all 
unary mappings from A to A (i.e. elements of the set (A)) and 
clones, by redefining the relation 0 in any of the three following ways. 
Let Ta = A A ( = (A) ), r E Ta and B 6 £(A). We can define 0 

as 

(c) t 0 B r fulfills the condition (a) for the algebra (A; B) (see 
Preliminaries); 

(d) r01^r fulfills the condition (b) for the algebra (A; B) ; 

(e) t 0 B r is a weak endomorphism of the algebra (A; B) . 

Now, if H C Ta and T C£ (A) , then taking each of definitions (c), 
(d), (e) for the relation 0 and defining operators H , T as follows: 

H ^ P(H) = {Be £(A) | (Vr G H) [r 0 B]} , 

T ^H(P) = {t e T a I (VB eP) [r 0 B]} , 



one gets three different Galois connections. 

If we consider definition (d) and H C Ta is a subsemigroup of the 
semigroup Ta, then P(H) is the set of all H -clones (see [24]). 
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5 Some compatibilities and Galois connections 

In this section we will present a construction which generalizes the 
notions of weak automorphisms and preservation of a relation by an 
operation. Let A be a fixed non-empty set. For every g £ R^(A), 
i.e. g C A n , we can define an n-ary relation g on 0( TO )(A), i.e. g C 
(0( m )(A)) n as follows: 

(/i, ■■■,/n) e ^ iff 



(Varn , ■ ■ • , X\ m j *^21 j • • • j *^2 m > • • • > %nl j ■ ■ ■ > X nm £ -A) 



[(Vj e {!)•■■) m }) {{xij j • • • j %nj ) £ g) => 

((/l (*Dl j ■ • ■ j ^lm) i f'l (*^21 j • • • j %2m) •> • • • t fn (^nl j • • • ) ^nm)) G £*)] , 



i.e. 



( /i (xn, . . . ,x lm ) \ 

y fn • • • > Xnm) J 
£ g £ g 



£ g. 



This definition encompasses a number of special cases. An operation 
/ satisfies (/,...,/) £ g exactly when / preserves the relation g (see 
Section 4). In this way, for any set of relations S C M(A) and for any 
set of operations F C 0(4), we have 

Pol (S) = { / G O(A) | (Vg G S) [(/, •••,/) G 5]} and 
Inv(¥) = {g£R(A) \ (V/ 6 F) [(/,...,/) G . 

Let a £ Sa (or more generally, a £ A A ). Define g a as follows: 

(x,y) £ g G ^ a(x) = y. 



Then 

if, g) G & iff 0- (/ (xi, . . . , x n )) = g (a (xx) (x n )) , 

which means that a (/) = g. 

Such a situation can be generalized in the following way. 
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A pair of clones A, B G C(A) are ^-compatible (where g is a binary 
relation on A) if for every / G A there exists gGl such that (/, g) G 
g and for every g G B there exists / G A such that (/, g) G g. If 
A = B, then the pair A, B is po-compatible (for a G A A ) iff a is a 
weak endomorphism of 21 = (A, A) (or a weak automorphism, if a is 
a permutation of set A). 

Now, for i > 1, we define the relation 0, between elements of R(A) 
and C (A) as follows: 

£ ©j C iff ((/i, . . . , fi-i, fi, fi+u ■ • • , /n) G 0 & /i, . . . , fi-i, fi+i 

gC^/aC) &((V/i gC)(3/ 1 ,..., f n eC)) [(/!,..., 

fi- 1, fi j fi+ 1) ■ ■ ■ i fn) £ £>], 

where g G R(A) is a relation of arity > i and C G C{A). Thus the 
Galois connection related to weak automorphisms from Section 4 can 
be generalized in the following way: 

§ .F(S) = {Cg C(A) I (Vge S)[e©i C]}, 

T ^n(F) = {geR(A) I (VC e?)[g ©*€]}, 

where S C M(A), T C C(A) and i > 1. 

In particular, if S = { g a | a G G} for some group G < Sa of permu- 
tations and i = 2, then T{ S) is the lattice of all G-clones. 

Added in proof: 

The authors obtained an information from Agnes Szendrei that the 
answer to question A (and for two its equivalent forms, namely ques- 
tions B and C) is NO. The answer is finally due to S. Marchenkov 
([34]) and it is based on the papers [29], [30], [32], [33] (see also [51] 
in this volume). 

The authors would like to thank A. Szendrei for this important infor- 
mation. 
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A Survey of Clones Closed Under 
Conjugation * 
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A. Szendrei 



Abstract 

There is a Galois connection between the lattice of clones on a set A and 
the group of permutations on A that is determined by the relation that 
a permutation conjugates a clone onto itself. The Galois-closed sets on 
the clone side are the lattices Lq of all clones that are closed under con- 
jugation by all members of some permutation group G. In this paper we 
discuss the coarse structure of the lattice Lq when A is finite and G is 
a 2-homogeneous permutation group, describe Lq completely for the case 
when G is the group of all permutations, and discuss for which groups G 
the lattice Lq is finite. 

AMS Mathematics Subject Classification: Primary 08A05 , Secondary O8A4O , 
08 A3 5, 03B50 

Key words: Clone , Conjugation , Weak automorphism. 



1 Introduction 

Many important symmetries of algebras are not automorphisms. For 
example, inversion 7 : G ^ G, g ^ g~ x is an antiautomorphism of a 
group, and this is a type of symmetry that is not an automorphism 
(unless G is abelian). Transpose 7: M n ( R) — > M n ( K), A 1— >• A T is 
an antiautomorphism of a matrix ring, which is a symmetry that is 
not an automorphism (unless n = 1 ). Complementation 7: B — » B, 

*This material is based upon work supported by the Hungarian National Foun- 
dation for Scientific Research (OTKA) grants no. T 026243, T 034175, T 37877. 
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b i — ^ b in a Boolean algebra is a symmetry that is not an auto- 
morphism. If K is a field and a is an automorphism of K, then 
the coordinatewise action of a defines a symmetry 7: K n — »• K n , 
(k u ..., k n ) h- (a(k \), . . . , a(k n )) of the n-dimensional A-vector space 
K n that is not an automorphism (unless a = id). 

The kind of symmetry exhibited by these examples is the focus of this 
paper. For the general concept, let A be a set and 7 be a permutation 
of A. If / : A n —>■ A is an operation on A, we define the conjugate of 
/ by 7 to be 



7 /(Tl; • • ■ > x n) = 7/ (7 1 (*^ri)) • 

The conjugate of a set F of operations by 7 is T = { 7 / : / G F}, 
and the conjugate of an algebra A = (A; fi, / 2 , . . .) by 7 is the algebra 
7 A = (A; 7 /i, 7 / 2 , . . .). It is easy to see that 7 is an automorphism 
of A if and only if 7 /j = /* for all fundamental operations fi of A. 
The common feature of the symmetries mentioned in the preceding 
paragraph is that they are permutations 7 of the underlying set of an 
algebra A = (A; /1, / 2 , . . .) such that A and 7 A have the same term 
operations. Such a permutation 7 is called a weak automorphism of 
A. In other words, 7 is a weak automorphism of A if and only if the 
clone £ of term operations of A satisfies the equality 7 £ = £. 

For any fixed permutation 7 on a set A, conjugation by 7 yields an 
automorphism £ h -4 7 £ of the lattice of all clones on A. The con- 
dition 7 £ = £ defines a Galois connection between clones on A and 
permutations on A as follows: we assign to every set of clones the 
collection of all permutations that fix every clone in the set, and to 
every set of permutations the collection of all clones that are fixed by 
every permutation in the set. 1 It is easy to see that the Galois-closed 
sets of clones are complete sublattices of the lattice of all clones, while 
the Galois-closed sets of permutations are subgroups of the symmetric 
group on A. For a permutation group G, the members of the Galois- 
closed set {£ : 7 £ = £ for all 7 G G} of clones corresponding to G 
will be called G-closed clones. It is easy to see from the definitions 
that the family of G-closed clones includes all clones £ that contain 
G, and also all clones £ for which G is a group of automorphisms of 
the algebra (A; £}. 



1 This Galois connection is discussed in detail in the paper [8] in this volume. 
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Our aim in this paper is to give an overview of some results on the 
lattice of G-closed clones on a finite set when G is a large permuta- 
tion group. In Section 3 we discuss Szabo’s theorem [38] which can be 
used to determine the coarse structure of the lattice of G'-closed clones, 
including all coatoms of the lattice, when G is a 2-homogeneous per- 
mutation group (a property slightly weaker than 2-transitivity). In 
Section 4 we present a complete description of all c? 4 -closed clones 
where <S^ is the symmetric group on A. This description is a modified 
version of results obtained earlier by Hoa [15] and Marchenkov [28]. In 
Section 5 we briefly discuss a recent result [20] which lists all groups 
G such that the family of G-closed clones that contain all constants 
is finite. For each major theorem mentioned above we outline a proof 
which, in some cases, differs essentially from the proofs published ear- 
lier. Our goal is to emphasize the advantages of combining different 
methods: using operational and relational arguments simultaneously, 
which is facilitated by the Galois connection between operations and 
relations 2 , and applying localization and globalization from tame con- 
gruence theory in combination with these arguments. 

Finiteness of the base set is crucial in these results and methods. For 
the case when the base set A is infinite, Goldstern and Shelah [10] 
obtained interesting results on a special class of ^-closed clones. They 
studied the lattice of all clones that contain the full transformation 
monoid on A, and found that the structure of this lattice depends 
heavily on partition (Ramsey) properties of the cardinality |A| of the 
base set. Specifically, if | A | is a weakly compact cardinal, then this 
lattice has exactly 2 coatoms, while if |^4| is, say, the successor of an 
uncountable regular cardinal, then this lattice has 2 2,A| coatoms. 



2 Preliminaries and examples 

For any set A, a done on A is a collection of Unitary operations on 
A that contains the projection operations and is closed under compo- 
sition. For any set F of operations on A, [F] will denote the clone 
generated by F. In other words, [F] is the collection of all term op- 
erations of the algebra A = (A;F), which is called the clone of A, 
and is also denoted by Clo(A). If A = (A; F) is an algebra, then A c 

2 For more details about this Galois connection, see [34] in this volume. 
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will denote the algebra that arises from A by adding all constants as 
fundamental operations. The clone of A c , that is, the clone generated 
by F and all constants, is called the clone of polynomial operations of 
A, and is denoted by Pol(A). For n > 1, Clo„(A) and Pol„(A) will 
denote the set of n-ary term operations, and the set of n-ary polyno- 
mial operations of A, respectively. A finite algebra A is called primal 
if Clo(A) is the clone of all operations on A, and functionally com- 
plete if A c is primal. We will call A a projection algebra if Clo(A) is 
the clone of projections. Two algebras are said to be term equivalent 
if they have the same clones, and they are said to be polynomially 
equivalent if they have the same clones of polynomial operations. 

An operation / on A is idempotent if it satisfies the identity f(x,...,x) 
= x. An algebra A is idempotent if all fundamental operations (and 
hence all term operations) of A are idempotent. A clone £ is idempo- 
tent if all operations in £ are idempotent. A ternary operation / is a 
Mal’tsev operation if it satisfies the identities f(x, y, y) = f(y , y, x) — 
x] f is a minority operation if it satisfies the identities f(x,y,y) = 
f(y,x,y ) = f(y,y,x) = X] and / is a majority operation if it satisfies 
the identities f(x, y, y) = f(y, x, y) = f(y , y, x) = y. 

We will use the notation Ta,Ca, Sa, and A a for the full transformation 
monoid, its subsemigroup of all constants, the symmetric group, and 
the alternating group on A, respectively. For |A| =4, Va will denote 
the Klein group on A. 

Let 7 E Sa- For an n-ary operation f on A the conjugate of / by 7 is 
the operation 



J f( x l,---, X n ) — 7/ (7 

The conjugate of a set F of operations by 7 is defined as W = { 7 / : 
/ E F}. It is straightforward to check that 7 £ is a clone for every 
clone £ and permutation 7, and the mapping £ i — > 7 £ defines an 
inner automorphism of the lattice of clones on A for every 7 E Sa- 
If the clone of an algebra A is fixed by this automorphism, that is, 
if 7 Clo(A) = Clo(A) holds, then 7 is called a weak automorphism 
of A (see [8] or [9]). The weak automorphisms of A form a group, 
which is called the weak automorphism group of A, and is denoted by 
WAut(A). It follows immediately from the definitions that for any 
algebra A, the automorphisms of A as well as the permutations g 
with g, g~ l E Cloi(A) are weak automorphisms of A. In fact, it is 
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clone of all operations 




Fig. 1: S A -closed clones on A = {0, 1} 



easy to check that the automorphism group Aut(A) of A is a normal 
subgroup of the weak automorphism group WAut(A), and so is the 
largest subgroup {g G S A : g, g~ l € Cloi(A)} of Cloi(A). If A is 
finite, then the latter group is fl Cloi(A). 

For a group G C S A and a set F of operations on A we will say that 
F is G-closed if 7 F = F for all 7 G G. In particular, it follows that 
a clone C on A is G-closed if and only if it is fixed by every inner 
automorphism of the clone lattice that is induced by a permutation 
from G, or equivalently, <£ is G-closed if and only if G is a subgroup 
of the weak automorphism group of the algebra (A; <£). Clearly, if G 
is the one-element group, then every clone is G-closed. 

2.1 Example. On a 2-element set A = {0, 1 } the only nontrivial 
group is G = S A , and the 5^-closed clones can be easily determined, 
using Post’s description [36] of all clones. Figure 1 shows the lattice of 
all these clones. In the diagram 0, 1, A, V, “ denote the Boolean algebra 
operations, p(x, y,z) = x + y + z is the unique minority operation, and 
m(x, y, z) = (x A y) V (x A z) V (yAz) is the unique majority operation 
on A = {0, 1}. 

From now on we will assume that A is a finite set with at least three 
elements, and A is the underlying set of all algebras and clones consid- 
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ered. A relation g C A m is called a compatible relation of an algebra 
A = (A; F) if g is (the underlying set) of a subalgebra of the m-th di- 
rect power A m of A. Examples of compatible binary relations include 
congruences and the graphs of automorphisms. (The graph of an n-ary 
partial function h is defined to be the (n + l)-ary relation consisting 
of all tuples (ai, . . . , a n , h(ai, . . . , a n )) such that (ai, . . . , a n ) is in the 
domain of h.) If g is a compatible relation of the algebra (A; /), we 
will also say that f preserves g. 

It is well known (see e.g. [34, 35]) that for finite A, clones on A can 
be characterized as the Galois-closed sets of operations in the Galois 
connection between operations on A and relations on A that is defined 
by the condition “/ preserves g" . This Galois connection assigns to 
every set of operations the collection of all relations that are preserved 
by every operation in the set, and to every set of relations the collection 
of all operations that preserve every relation in the set. It is easy to 
see that Galois-closed sets of operations are clones. The fact that all 
clones are in fact Galois-closed provided the base set is finite, can be 
expressed as follows. 

2.2 Claim. [34, 35] On a finite base set A, an operation f belongs 
to a clone £ if and only if f preserves all compatible relations of the 
algebra (A; (£). 

For any fixed permutation group G C Sa one can modify this Galois 
connection so that the Galois-closed sets of operations are exactly the 
G-closed clones (cf. e.g. [11]). For any 7 G Sa and any relation g C A m 
let 

7 0 = i{q) = {(7(^1), • • • , l{x m )) : (x u . . . , x m ) G 0}. 

Since 7 / preserves g for all 7 G G if and only if / preserves 7 g for 
all 7 6 G, these two conditions define the same Galois connection, 
and it follows that the Galois-closed sets of operations in this Galois 
connection are exactly the G-closed clones. In other words: 

2.3 Claim. [11] On a finite base set A, either one of the following 
conditions characterizes G-closed clones £: 

(1) 7 f £ £ for every f £ £ and 7 G G; 

(2) 7 £> is a compatible relation of the algebra A = (A; (£) for every 
compatible relation g of A and for every 7 G G. 




